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ABSTRACT 


The N- and C-terminal telopeptides of the a-2 chain of type | collagen in 
CD30H/H20 solutions have been studied using high-field 1H and 1SC NMR. The 
conformation of the N-telopeptide (nonamer) is predominantly extended with a 
small proportion of the molecules existing in a type | B turn. The four residues 
involved in this turn are D3-A4-K5-G§ which is stabilized by a C = O(D3) - NH(G§) 
hydrogen bond. The C-terminal telopeptide is extended throughout. A model is 
proposed involving charge-charge and hydrophobic interactions between the 
extended a-2 chain N-telopeptide and the adjacent segments of the triple-helix. 
A similar model is proposed for the C-telopeptide. 

The conformations of the type Il and type Ill collagen a-1 chain 
C-telopeptides in CD30H/H»5O (80/20) solution have been studied by means of 
two-dimensional proton NMR and circular dichroism spectroscopy. The 
conformation of the type II C-telopeptide is mostly extended. Evidence from CD 
spectroscopy suggests that a minor proportion of the peptide might be helical 
(ca 8%). The conformation of the type IIl C-telopeptide is mostly extended 
except for a f-turn ranging from Gly® to Glu!, which is stabilized by a hydrogen- 
bond between the NH group of Glu'! and the carbonyl group of Gly8. 

The importance of side-chain charge interactions in the formation of B- 
turns has been examined. Sixteen protected NAc-tetrapeptide amides were 
studied. The results indicate that a small proportion of type | f-turn exists in 
solutions of DEKS and DERS in methanol/water (60/40), while NEKS has an 
even smaller population of this turn. The other tetrapeptides are present in 
solution only in the extended conformation. These results clearly show the 
importance of the salt-bridge between the side-chains of K2 and ES or R@ and 


ES as well as the importance of the charge on the side-chain of the first residue 
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in stabilizing the B-turn. The relevance of statistical predictions for ®-turns in 
short peptides is also discussed. 

The solution conformations of DArg9-[Hyp3, Thi?, DCpg’, Cpg8}]-BK and 
DArg9-[Hyp’, DCpg’, Cpg8]-BK, two extremely potent bradykinin antagonists 
and two related antagonists which differ from each other only by one or two 
residues, were investigated by 1H NMR in a methanol/water (80/20 v/v) solution. 
A turn-like structure, which coexists with the extended conformation, was 
observed between residues 2 and 5 for the most active antagonists, in direct 
correlation with the peptide activities. The data suggest that a turn-like structure 
between residues 2 and 5 could be important for antagonist activity and that the 
conformational difference between bradykinin agonists and antagonists is due to 
a turn formed at different positions in the peptides. 

Finally, the aggregation properties for two bradykinin analogs, namely 
DArg-[Hyp3, Thi?, DSer®, DCpg’, Cpg8}-BK [I] and DArg-[Hyp’, DSer®, DCpg’, 
Cpg8]-BK [II] were investigated by 1H NMR and CD75. These two molecules 
have exhibited an abnormal, non-linear temperature dependence for the amide 
NH proton of Cpg8. The NH of Arg? also shows a slightly non-linear 
temperature dependence at temperatures above 25 °C. In addition, a very slow 
exchange rate for the NH protons of DCpg’, Cpg8 and Arg9 indicated 


aggregation of these two analogs which was confirmed using circular dichroism. 
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CHAPTER 1 


Introduction 


Collagens and Telopeptides 

Collagens are a family of chemically and structurally related, but 
genetically distinct, proteins that are found in the connective tissues of all 
multicellular organisms. The association of collagen fibers into bundles 
determines the architectural organization of the connective tissues and 
conditions their mechanical properties!:;*. Of the fourteen or more collagen 
types that are found in higher vertebrates, type I, type II and type Ill, i.e., the 
fibril-forming interstitial collagens, are the most common ones and are axially 
aligned in staggered arrays to produce the bundles (fibrils) that comprise a major 
constituent of many structurally important biological materials, such as tendon, 
skin, and bone. 

Many physical and physiochemical studies, including direct visualization 
of individual molecules in the electron microscope, have demonstrated the rod- 
like nature of collagen molecules. Molecules of collagen have a length of slightly 
less than 300 nm and a diameter of about 1.4 nm. The rod is neither rigid nor 
randomly flexible but appears to possess an intermediate level of flexibility which 
probably varies along its length. Collagen monomer is characterized structurally 
by a triple-helix which consists of three polypeptide chains (a chains), each a 
little over 1000 amino acid residues long. A prerequisite for the formation of this 
triple helix is a Gly-X-Y repeating tripeptide unit in the amino acid sequence of 
the three chains, where X and Y can be any amino acids but are often the imino 
acids proline and hydroxyproline. This sequence, with glycine in every third 


position and with an unusual abundance of hydroxyproline, forms the basis for 
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the chemical identification of collagen. The details of the three dimensional 
structure of the triple helix, established by X-ray diffraction and using known 
bond lengths and bond angles, have been reviewed’. At each end of the a- 
chains are short sequences without the repeating G-X-Y unit, called 
"telopeptides", which do not adopt the triple-helical conformation. Telopeptides 
constitute less than 4% of the complete collagen molecule, but contain the only 
tyrosyl residues of the molecule and the lysine residues, one in each telopeptide, 
which provide the potential sites for the formation of covalent cross links4. The 
formation of these covalent cross-links is the final step in the production of the 
stable functional unit of collagen - the fibrilS. Inhibition of cross-link formation is 
cited as part of the underlying pathology in several connective tissue 
diseases*:®, Conversely, there is evidence that increased cross-linking is 
responsible for detrimental changes in the mechanical properties of fibrils during 
aging’. The telopeptides are sites of the major antigenic determinants of the 
molecule; they are the only sites for attack by any proteinases other than triple 
helix-specific collagenase. An outline of a basic collagen monomer structure is 
shown in Figure 1.1a. 

Type | is the most abundant form of collagen and found as relatively thick 
fibrils, approximately 20 to 200 nm or more in diameter, depending on the tissue 
and age®.9. When organized into fibres and fibre bundles, they give mechanical 
strength to tissues such as dermis, tendon, bone and dentine. The three a- 
chains are non-identical in type | collagen, where the monomer comprises two 
identical a1(l) chains and a nonidentical a2(l) chain. The amino acid sequences 
of the a1(l) and a2(l) chains are known!9:11, Species differences occur, but a 
substantial measure of homology between species exists for type | collagen! 
Of 1050 amino acids in the a1 chain of calf skin type | collagen, 1014 occur in 


the repeating Gly-X-Y triple helix. The N-terminal telopeptide has 16 residues 
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and the C-terminal telopeptide 25 residues. The a2(l) chain has 1029 amino 
acids and telopeptides are 9 and 6 residues long for N-terminal and C-terminal, 
respectively. (Figure 1.1b). 

Type Il has three identical a-chains. Whereas the sequences of type | 
collagen telopeptides are highly conserved between species, there is 
considerable variability between the telopeptides of different genetic types of 
collagen types. Type II collagen is found in adults only in hyaline cartilage, 
where it is the dominant form, and in vitreous humour. /n vivo, it is present as 
thin fibrils which are critical components in the physical integrity of cartilage. The 
N- and C- telopeptides of human type II collagen are 19 and 27 residues long, 
respectively!S (Figure 1.1c). Type Ill is the second most abundant form of 
collagen in soft tissues other than cartilage. It is prevalent in tissues where 
distensibility is a major requirement for function, such as blood vessel walls, lung 
parenchyma, intestine and uterine wall'4. /n vitro, type Ill collagen has been 
shown to form thinner fibrils than type | and to reduce the diameter of 
copolymeric fibrils containing predominantly type |15. /n vivo, it appears that the 
fibrils in many tissues contain both types | and Ill, but at different ratios in 
different organs and tissues!:'©, The type III collagen monomer consists of 
three identical a-1 chains. Sequence data are available for the bovine N- 
telopeptide, which is 14 residues long!” and the human’ and chick’? C- 
telopeptides which are 25 and 23 residues long, respectively. (Figure 1.1d) 

With the aim of understanding how the mechanical properties of 
connective tissues can be related to the underlying molecular structures of 
collagen, collagen fibrillogenesis, the process of fiber assembly, has been 
intensively studied in hopes of elucidating the mechanism of this important step 
in the biogenesis of connective tissues. Most in vitro investigations have 


centered around the observation that collagen molecules have the ability to self- 
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assemble from solution, when the pH, temperature and ionic strength are 
adjusted to physiological values®922, The fibrils produced in vitro are 
indistinguishable from those found in tissues?3-25_ It is now generally accepted 
that all the basic information needed to form the fibril is clearly contained in the 
collagen amino acid sequence and in the intrinsic properties of collagen per se, 
although other factors and other components of the extracellular milieu do play 
an important role in the process in vivo!2, 

In vitro studies of the self-assembly of collagen fibrils have supported the 
concept?% of the initial formation of "nuclei" which appear to be end-overlap 
structures®5:27, The formation of "nuclei", or initial aggregates, is followed by 
lengthwise growth of the microfibril and subsequent lateral assembly into 
fibrils?®.29, One aspect of the self-assembly process which was recognized very 
early was that proteolytic degradation of the telopeptides of collagen, without 
modifying the triple helical domain, would lengthen the fibril initiation and growth 
phases, or completely abolish the ability of the collagen molecules to self- 
assemble28,30-32, The addition of free amino-terminal telopeptide, isolated from 
the a1(l) chain, to a solution of native collagen, specifically enhanced the rate of 
fibril formationS3. From these results it has been deduced that the N-terminal 
telopeptide of «1(I) chain directs linear growth of type | collagen fibril8° while the 
C-telopeptide promotes both linear and lateral growth? 34, 

The functions of the telopeptides presumably depend on _ specific 
interactions between the telopeptides and the triple helices of collagen 
monomers adjacent to them in the growing fibrils. Because such interactions are 
likely to be strongly conformation dependent, it is now evident that a detailed 
knowledge of the conformations and interaction behavior of the telopeptides is 
an essential step toward a better understanding of many aspects of the fibril 


assembly process8. Since the telopeptides are very small compared to the 
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triple-helix and probably lack a well-defined periodic structure (see below), X-ray 
diffraction is unable to furnish any high resolution structure data; and electron 
microscopy can only indicate that the telopeptides in the fibril are in a condensed 
or folded conformation’. However, several attempts have been made to predict 
secondary structures from the amino acid sequences of telopeptides, and 
subsequently models for interaction between telopeptides and triple helices have 
been proposed?%>39, One common feature of these predictions is that a loop 
exists around the lysyl residue which is involved in the intermolecular cross-link, 
but in most cases the predictions even from the application of the same 
algorithm, have often varied between laboratories and physical evidence 
available has, until recently, been insufficiently detailed to support or refute the 
various models proposed. 

In this laboratory, the conformations of the telopeptides in solution have 
been studied using modern techniques of high-field NMR spectroscopy to 
provide more physical evidence and a better understanding of fibril assembly of 
collagens. So far, investigations on all the telopeptides in type |, Il and Ill 
collagens40-46 have been completed. 

In Chapter 2, the high-field 1H and 1SC NMR studies of the N- and C- 
terminal telopeptides of the a-2 chain of type | collagen in CD30H/H20 solutions 
are described. 

Chapter 3 presents the conformational studies of the type Il and type Ill 
collagen a-1 chain C-telopeptides in CD30H/H2O (80/20) solution by means of 
two-dimensional proton NMR and circular dichroism spectroscopy. 

Chapter 4 discusses the importance of side-chain charge interactions in 
the formation of f-turns. Sixteen protected NAc-tetrapeptide amides were 
studied, namely the variants of DEKS; NEKS, EEKS, DDKS, DQKS, NQKS, 
DERS, NERS, EERS, DDRS, NDRS, DQRS, and DKES. Three tetrapeptides, 
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NPDM, NSDM and NDDS, were also studied as they have a high probability of 


forming B-turns, based on statistical predictions. 


Bradykinin and its Analogs 

Bradykinin, Arg-Pro-Pro-Gly-Phe-Ser-Pro-Phe-Arg, which may _ be 
considered as a prototype for the kinins and is naturally present in human body 
fluids (blood and urine), is known to possess extraordinarily pharmacological 
activities such as vasodilation and algesic activities4’-49. It is one of the most 
potent vasodilators and causes dilation of blood vessels in muscle, kidney, 
viscera and various glands and also in the heart and brain5°, Various smooth 
muscle preparations are found to contract in response to bradykinin. It is also a 
powerful algesic agent and causes an intense burning pain when applied to the 
exposed base of a blister, and a throbbing, burning pain in the hand when 
injected into the brachial artery. Recently, it has been suggested that bradykinin 
may be associated with the symptoms of the common cold91.52, 

In order to investigate the role that bradykinin plays in a variety of 
pathophysiological processes, the preparation of effective competitive 
antagonists is essential for the study of the actions of molecules involved in 
biological communication as in the case of bradykinin. Bradykinin can produce 
effects on its receptor which makes accurate evaluation difficult, if not 
impossible. In addition, rapid destruction of agonists causes serious problems in 
the estimation of their presence in biological systems and of their potencies in 
assays. Properly designed antagonists can overcome these problems®9, and 
the design and synthesis of potent, stable and specific bradykinin antagonists 
has long been considered a desirable goal in medicinal chemistry. Compounds 
with these characteristics could be important in the treatment of such diverse 


disorders as septic shock, asthma and rhinitis#9.54. Because of the fact that 
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neither X-ray crystallographic nor NMR data pertaining to the receptor or ligand- 
receptor complex is available, many analogs of bradykinin have been 
synthesized in the period since the announcement of the structure of bradykinin 
in 1960, in order to increase the potency, selectivity and lifetime of action of 
bradykinin analogs and to obtain insight about their bioactive conformation. In 
almost all of these syntheses, analogs of natural amino acids were used to 
impose specific conformational constraints5>.56, 

The first report of bradykinin analogs able to antagonize the action of 
bradykinin in the standard kinin assay system came in 1985, with the description 
of [DPhe’]-BK (BK = bradykinin, [DPhe’]-BK = Arg-Pro-Pro-Gly-Phe-Ser-DPhe- 
Phe-Arg)°’. Before this, only agonists were obtained. Structures and biological 
activities of several hundred agonists were summarized in the reviews of 
SchroderS8 and StewartS9. The critical change conferring bradykinin antagonist 
activity upon analogs was the replacement of the Pro at position 7 with a D- 
aromatic amino acid residue. Acting on the assumption that modifications 
previously found on agonists to enhance receptor affinity might increase the 
potency of [DPhe’]-BK, a weak antagonist, Thi (Thi = f-(2-thienyl)-L-alanine) 
was placed in positions 5 and 8, Hyp (Hyp = trans-4-hydroxy-L-proline) was 
placed in positions 2 and 3, and basic residues or dipeptides were added to the 
amino end of the moleculeS3’. These manipulations were successful in 
converting the weak partial antagonist [DPhe’]-BK to a much more potent 
analog with antagonist activity on all the standard assay tissues®062, Among 
these early antagonists, DArg-[Hyp, DPhe’]-BK and DArg-[Hyp3, Thi?.8, 
DPhe’]-BK were the best. Now DArg-[Hyp3, DPhe’]-BK is in clinical trials for 
use as an antibradykinin drug in several pathological states. 

The discovery of the first bradykinin antagonist established the necessary 


and, at times, sufficient requirement that a D-aromatic amino acid residue 
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replace the Pro residue at sequence position 75357, Additionally, the 
combination of a D-aromatic residue at position 7 with an L-Pro residue replacing 
the L-Phe at position 8 was also shown to produce antagonists®3, Recently, 
bradykinin analogs with a modified D-Phe residue at position 7 combined with 
Pro derivatives at position 8 were demonstrated to be the most potent and long 
acting antagonists®4. So far, bradykinin analogs having bulky, beta-branched or 
cyclic D-aliphatic residues at position 7 with bulky, or cyclic L-aliphatic residues 
at position 8, such as DArg9-[Hyp’, ThiS, DCpg’, Cpg8]-BK (Cpg = a- 
cyclopentyl-glycine, see Figure 5.1 for the primary structure), have yielded the 
best antagonists®5.66, as have bradykinin antagonist dimers®’”. Thus, both the 
“conventional wisdom" that a D-aromatic residue is required at position 7 of BK 
antagonists, and structure-activity relationships leading to models for receptor 
interactions based on aromatic side chains at position 7 and/or position 8, must 
be altered®5, 

The different biological activities of bradykinin analogs are believed to be 
conformation dependent. In the absence of a known receptor structure and a 
very small binding constant between bradykinin analogs and their receptors, the 
large number of bradykinin analogs, together with their structural and biological 
activities, provides a solid foundation for gaining insight into a possible bioactive 
conformation in solution. In various studies including 1H NMR in aqueous 
solutions, bradykinin was found to exist in many conformational non-regular 
states®8, However, bradykinin in less polar solvents such as dioxane®9 and SDS 
(sodium dodecyl sulfate) micelles’? was found to possess a well-defined 
conformation characterized by a B-turn-like structure for residues 6 to 9. Earlier 
conformational studies in solution®®:©9 on bradykinin itself and an antagonist also 
led to the hypothesis that the difference between an agonist and an antagonist is 


related to the type of f-turn adopted at the C-terminus, together with the required 


: be += fhe 
wieoaiab I & “1 


oniteiqe? eubieat 1 1 rie 
a4 . } 


7. | 7 
AAG 


ces 
4 
Fats Ww 


an eoeeupee’ is suites) wt 


AAs oe tr fs Busta 295 3 semmosee-Ch alton ‘ 


Aras =@dainedeton eoubow, oP wale ies enw .£ rotROg | wi 
meen tins a giizen 6 6 hig att behibam # Citw enctens: 1 
oviel breed ‘le sen act uti oF baial nae Ba 2 mien) ie 
1 tyerteh- — vittuc 5 fF ' Werk. at 2 ’ 
Ai shed ghiaet O19)2 20 ad PP ere as e bis orn 
eit bablery oved (erhuile yiemid.a 14 Fup t 22 eriovig ae mo 
ee Hod | etna nirtiybond ome 5 ‘A s13inoge s Yes 
Mel yo eLel l uh ‘ J fart? “nvabselvy 1 
il fo if Pf anf) OSS Syearcta @ Vivilse ann: uz tie., 
tower ry ol } 200 76 eno =o BIaTwis fa pees 
al iba pits Pon ‘art wn gnu, > ia npoload nevahe + at 
e bog eiufouvte lotgsoer nwot earentedte ata wnebnegen: <—_ | 
wf soineses dient) Drs spy GE uri tuba Fie ated wnrstesine —— 
 Jasipolott bis te | Hiweeripgot egoions seiacaehe 
BUUSRO 20a a Oa F ; aay on Wg aol Powaon: ct poe ae = : 
suomids 7 AMU..4! onfuyien salints: munity nt: vai 


isicne nen Tsepiqrenin 


ae brs Veimsxoiy So 110 


bey Wet ew S htcinin a 


sothest gard eon 


“ 


¥ 


a , 
oa bo oad) wy 6 
eS ae < a 


iow 


- 


) ‘4 
28 Yi | 7 
J 


7 wt ert va oil 
bale ea yeincgsne Ts bit weat i me PT nat it 


res at a . 


+ be sot one rete 
1 Rane vate zeai ru nisnihyon 
6} ape schieoa a 


or fee hi oo 


wy 


ay. 


; a colt - 


ae it 


af 


orientation of the side chains’9°. To challenge this hypothesis, Kyle et al.68,71 
designed and prepared several constrained bradykinin analogs, which were 
assumed to have an inherently stabilized B-turn geometry at their C-terminus 
based on conformational analysis using empirical calculations. Two of them with 
big bulky side-chains at position 7 or 8, such as DArg9-Arg!-Pro2-Hyp3-Gly4- 
Thi9-Ser8-DTic’-Tic8-Arg9 (Tic = tetrahydroisoquinoline carboxylic acid 
residue), were found to be competitive antagonists, but NMR data were not 
reported for them. More recently, two bradykinin analogs containing a-MePro at 
position 3 or position 7 were reported to show reversed-turn conformations at 
both Pro?-Phe? and Ser6-Arg9, respectively, in aqueous solution by NMR, but 
no data on activity were reported for these molecules”. 

In chapter 5, the solution conformations of [DArg2, Hyp3, Thi9, DCpg’, 
Cpg§]-BK and [DArg9, Hyp3, DCpg’, Cpg8}-BK, two extremely potent bradykinin 
antagonists and two related antagonists which differ from each other only by one 
or two residues, were investigated by 1H NMR, in order to provide more 
experimental evidence for the hypothesis that a f-turn in the four C-terminal 
amino acid residues of bradykinin analogs might be a prerequisite for high 
potency®8. This study was carried out on the antagonists in the free state, as it 
is impossible to carry out NMR studies when they are bound to the receptor. 
These antagonists function at nanomolar concentrations, far below the limit of 
NMR detectability. The solvent for this study is a methanol/water (80/20 v/v) 
mixture, which is a structure-inducing solvent that has been used in the study of 
other small peptides4®. 

In chapter 6, the 1H NMR and CD studies of the aggregation properties 
for two bradykinin analogs, namely DArg-[Hyp3, ThiS, DSer®, DCpg’, Cpg§]-BK 
[I] and DArg-[Hyp3, DSer§, DCpg’, Cpg®]-BK [Il] (Cpg = a-cyclopentyl-glycine; 
Hyp = 4-hydroxy-L-proline, Thi = (3-(2-thienyl)-L-alanine), were described. 
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Secondary Structures in Small Peptides 

A certain conformation of a peptide can be described by a set of dihedral 
angles as indicated in Figure 1.2, since all of the bond lengths and bond angles 
are invariant, except perhaps for t, the backbone N-C@-C' angle, which is usually 
110°, but could stretch to larger values in order to accommodate other strains in 
the structure’3, The dihedral angle w at the peptide is very close to 180° and the 
C@(i) and C@(i+1) are all trans to each other, because the peptide bond has 
partial (40%) double-bonded character. Cis peptide bonds, with w=0°, can occur 
perhaps 25% of the time before proline residues but essentially rarely before any 
other residues. The proline ring is not quite flat, and occasionally protein 
structures are not being refined accurately enough to determine the direction of 
ring pucker. Considering the fact that dihedral angles for side chains are more 
difficult to define because of the greater flexibility of side chains, dihedral angles, 
o and w are the source of essentially all the interesting variability in peptide 
conformation. Actually, secondary structures are usually refered to as regular 
arrangements of the backbone of the polypeptide chain without reference to the 
side chain types or conformations’4:9, 

The value of each particular dihedral angle adopted by a residue in a 
molecule depends upon the overall energy of the molecule, which depends on 
bond angles and length, on the interactions between atoms not linked by 
covalent bonds and on geometrical restraints’4"”©. The permitted values of > 
and w are generally indicated on a two-dimensional plot of -1 angles known as 
the Ramachandran plot’’. In a typical Ramachandran plot for a nonglycine, only 
8% to 20% of the area in the plot is sterically allowed’375, 

Many peptides have been found to adopt a few regular secondary 


structures that are also found in natural proteins. Although these regular 
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secondary structures in peptides and in proteins are homologous structures, the 
difference in molecular sizes leads to some distinct geometric and energetic 
characteristics for the regular secondary structures in peptides®>. First, nearly 
all atoms in peptides are exposed to solvent to some degree; there is no sharp 
distinction between surface and interior groups. Secondly, peptides adopt 
conformations that are necessarily determined by short-range interactions. 
Finally, they are often conformationally mobile and their preferred conformations 
are strongly influenced by interactions with other molecules and by the nature of 
their environment. Although these characteristics of linear peptides make them 
difficult targets of conformation analysis, their intrinsic flexibility makes them 
sensitive indicators of the relative stabilities of various conformational states. 
The observation of one conformation for a linear peptide in solution or in crystals 
provides evidence for the stability of that conformation relative to other 
conformational states which are potentially available. 

The conformations of linear peptides have been of interest to 
theoreticians and experimentalists for many years. The idea that secondary 
structures have functions related to their structural characteristics appears 
throughout the literatureo5.56, Recently, the discovery that peptide fragments of 
proteins in solution, especially in water, do, in many cases, exhibit significant 
conformational preferences for recognizable structures’®-85 was of great interest 
and significance in several fields, notably those that sought insights into the 
earliest events that initiate protein folding®®&99 and into the likely mechanism of 
induction of anti-peptide antibodies9". 

In order of the frequency in which they appear in peptides, regular 


secondary structures of peptides can be classified into the following classes: 
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Reverse Turns and Nascent Helices 

Reverse turns (also known as tight turns, 8 bends, hairpin bends, etc.) are 
the simplest regular conformation formed by linear peptides. A turn is defined as 
a site where the polypeptide chain reverses its overall direction. In proteins, their 
frequent occurrence, accounting for about one-third of the total molecule’3, is 
responsible for the globularity of globular proteins. Turns are intrinsically polar 
structures with backbone groups that pack together closely and side chains that 
project outward. Such an array of atoms may constitute a site for molecular 
recognition, and indeed, the literature abounds with suggestions that turns serve 
as loci for receptor binding, antibody recognition, and _ posttranslational 
modificationS®:56, In addition, reverse turns probably play a key role in the 
initiation of protein folding and have been proposed as pivotal intermediates in 
the formation and denaturation of helices. 86,88,90 

Turns found in peptides can be classified into the B-turn and y-turn, i.e., 
turns of four or three residues, respectively, using hard-sphere calculations and 
model peptides’°. These turns may or may not be stabilized by an intra-turn 
hydrogen bond; in f-turns between the C=O of residue i and the NH of residue 
i+3, and in y-turns between the C=O of residue i and the NH of residue i+2. The 
term "open" B or y turns could be used for situations in which no hydrogen bond 
exists, and the $, y angles are within 30° of the ideal. {-turns can be further 
classified as types |, Il, Ill, their mirror images I’, Il’, Ill’, Vla(cis) and Vib(cis). 
turns can be further classified as regular and inverse y turns’3,74, Table 1.1 lists 
these turn classifications for both B and y turns with the @, 1 angles for the 
central residue(s)’3. Among all these turns, type | is the most common one 
accounting for about 35% of the total occurrences, followed by type II and III with 
a 15% probability for each. The type Ill turn actually has repeating torsion 


angles and is identical with the 340-helix. For both Type Ill and Ill’, the ideal 9, y 
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values are so close to the values for Type | and I' that they could be 
distinguished only in the most highly refined protein structures. Type I', II' and III' 
are disfavored because of steric hindrance. Type II usually requires a Gly 
residue in position 2. Type VI has a cis-proline in position 3; Type Vla has 9, w 
values in the @ region of the Ramachandran plot for the proline, a "concave" 
orientation of the middle peptide and the proline ring relative to the overall curve 
of the turn and typically is hydrogen-bonded. Type Vib has 9, yw values in the B 
region of the Ramachandran plot for the proline, has a "convex" orientation of 
the middle peptide and the proline ring and is usually not hydrogen-bonded. 

The term "nascent helix" has been used to describe an ensemble of turn- 
like structures over several adjacent residues of a peptide®°, Nascent helix 
structures typically occur in peptide fragments derived from helical segments of 


protein. 


Helices 

Helical conformations have been demonstrated by a variety of techniques 
in several linear peptides®!, The formation of helices in peptides has also been 
implicated in their ability to induce membrane fusion and in the mechanism of 
action of signal peptides®’. 

The a-helix has 3.6 residues per turn, with a hydrogen bond between the 
CO of residue i and the NH of residue i + 4. The closed loop formed by one of 
these hydrogen bonds and intervening stretch of backbone contains 13 atoms 
(including hydrogen). In the usual nomenclature for describing the basic 
structure of polypeptide helices, the a-helix is known as the 3.643-helix. The 
rise per residue along the helix axis is 1.5 A. All a-helices are right-handed (for 


L-amino acids, the left-handed a-helix has a close approach between the 
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carbonyl oxygen and the (-carbon). With 3.6 residues per turn, side chains 
protrude from the a-helix at about every 100° in azimuth. The backbone 
conformational angles for a right-handed a-helix are approximately = -60°, wp = 
-60°, which is in a favorable local conformation with a steep energy minimum, 
even ignoring the hydrogen bonds. a-helices are certainly the most regular 
pieces of structure to be found in globular proteins, but even so they show 
significant imperfections, such as bends in the axis of a helix. 

The only other principle helical species besides the a-helix which occurs 
to any great extent is the 349-helix, with a three-residue repeat and a hydrogen 
bond to residue i+3 instead of i+4. Its backbone conformational angles are 
approximately @ = -60°, wy = -30°, within the same energy minimum as the a- 
helix. However, for a long periodic structure, the 34 Q-helix is considerably less 
favorable than the a-helix in both local conformational energy and in hydrogen 
bond configuration. Long 34 9-helices are very rare, but short pieces of 
approximate 34 9-helix occur fairly frequently. Two consecutive residues in 349 
conformation form a type Ill B turn, and three consecutive 349-residues forming 
two interlocked tight turns are also fairly commonly seen. One important location 
for a single turn 34 ¢-helix is at the C-terminal end of an a-helix. 

The generally regular, repeating conformation in the a-helix places all of 
the charge dipoles of the peptides pointing in the same direction along the helix 
axis (positive toward the N-terminal end). It has been shown that the overall 
effect is indeed a significant net dipole for the helix, in spite of shielding effects, 


The helix dipole may contribute to the binding of charged species to the protein. 


B-Sheets 


Few well-characterized examples of (-sheet formation by short linear 


peptides are found because sheets are generally formed by association. 
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Principally, a B sheet is made up of almost fully extended strands, with o, w 
angles which fall within the wide, shallow energy minima in the upper left 
quadrant of the Ramachandran plot (9, -180° to 0°, w, 0° to 180°). B strands can 
interact in either parallel or antiparallel orientation, and each of the two forms 
has a distinctive pattern of hydrogen bonding. The antiparallel sheet has 
hydrogen bonds perpendicular to the strands, narrowly spaced bond pairs 
alternate with widely spaced pairs. Looking from the N- to C- terminal direction 
along the strand, when the side chain points up, the narrow pair of H - bonds will 
point to the right (C=O to NH). A parallel sheet has evenly spaced hydrogen 
bonds which angle across between the strands. Within a 6 sheet, as within an 
a-helix, all possible backbone hydrogen bonds are found. In both parallel and 
antiparallel 8 sheets, the side chains of adjacent amino acids along each strand 
alternate above and below the sheet, while those on the same side of the sheet 


are quite close together. 


Spectroscopic Methods to Detect Secondary Structures in Small 
Peptides 


Most, if not all, short linear peptides are present in solution as an 
ensemble of rapidly interconverting conformations. In favorable cases, the 
population of one or more folded conformers in the ensemble is high enough for 
detection by spectroscopic methods. Several spectroscopic techniques have 
been used to elucidate the solution conformations of linear peptides, including 
circular dichroism (CD), vibrational spectroscopy (Raman, Fourier transform 
infrared (FTIR) ), and nuclear magnetic resonance (NMR). In this thesis, NMR 
and CD were used to study the solution conformation of several peptide 


fragments in methanol/water solvent. 
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NMR Spectroscopy 


Nuclear magnetic resonance (NMR) is the most useful method for peptide 
conformational analysis in solution. NMR yields information about chemical 
environments of individual nuclei (chemical shifts), geometric relationship 
between nuclei (coupling constants), distances between nuclei (nuclear 
Overhauser enhancements (NOEs) ), accessibility of amide protons (exchange 
kinetics and sensitivity of resonance positions and linewidths to temperature, 
solvent, or paramagnetic probes), and dynamics of nuclei (relaxation times). 
More importantly, this wealth of information is site specific, so that properties of 
individual residues can be studied9*. When applied appropriately, NMR provides 
an exceptionally powerful technique for determining the structure of the 
dominant conformers adopted by a peptide in solution. 

The data for conformational analysis can be collected using one-, two- or 
three-dimensional NMR _ experiments92-96, For small linear peptides without 
isotope labeling, the most commonly used experiments are homonuclear 
two-dimensional COSY, DQCOSY, TOCSY, NOESY and ROESY experiments, 
which will be discussed in detail. Figure 1.38 shows the pulse schemes for these 
experiments. 

A 2D NMR experiment generally includes four successive time periods, 
preparation, evolution, mixing and detection. Preparation, evolution and 
detection are mandatory, and most 2D NMR experiments include a mixing period 
or a mixing pulse. The preparation period usually consists of a delay time, during 
which thermal equilibrium or steady state is attained, followed by one or more 
radiofrequency (rf) pulses to create the desired coherence. During the evolution 
period the coherence of the system is frequency-labelled in a manner which 
depends on the length of the evolution period. The mixing period may include 


one or more rf pulses and delay intervals. During the mixing period, information 
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is transferred between spins by some physical process, so that connections 
between nuclei labelled in the evolution period and nuclei detected during 
detection period are established. During the detection period the system evolves 
further and resulting free induction decay (FID) is recorded. 

In the two-dimensional homonuclear correlated spectroscopy experiment 
(COSY)97, mixing is achieved by a 90° pulse separating the evolution and the 
detection periods98, The COSY experiment produces correlation maps that 
display connectivities due to scalar spin-spin coupling and thus provides 
information on the proximity of nuclei connected by the chemical bonds92. For 
amino acid residues in peptides and proteins, COSY experiments can usually 
provide correlation between protons within each residue and the coupling 
constants between the NH and a-protons. COSY is normally performed using 
phase cycling which gives quadrature detection and phase sensitive 
presentation in both dimensionsYY. 

In DQCOSY (double quantum filtered COSY) experiments!°, the first 90° 
pulse creates single quantum coherence which is subsequently frequency 
labeled during the evolution period ty, as in conventional COSY. DQCOSY 
differs from conventional COSY in the coherence transfer pathway taken after 
the evolution period. In COSY, single quantum coherence is transferred from 
one spin to another; in DQCOSY the coherence transfer is through a transient 
state of multiple quantum coherence. The second 90° pulse excites all orders of 
multiple quantum coherence and the third pulse converts multiple quantum 
coherence into detectable single quantum coherence. The filter order is 
determined solely by the phase cycle of the second pulse. DQCOSY removes 
much of the dispersive contribution to the diagonal of the COSY spectrum and 
greatly attenuates the amplitudes of the diagonal peaks, which in turn reduces 


the amount of tj noise and makes it easier to analyze cross-peaks near the 
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diagonal. In DQCOSY, singlet resonances in the spectrum are removed and this 
increases the efficiency of water suppression. In addition, multiplets in DQCOSY 
spectra often have unique fine structures which provide increased resolving 
power in crowded regions plus information on the relative signs of the active 
couplings, therefore allowing distinction between geminal and_ vicinal 
coupling!00, 101, 

TOCSY, total correlation spectroscopy, was first proposed by 
Braunschweiler and Ernst!92. This experiment has particular potential for 
protein NMR studies since it permits correlation of all protons within a scalar 
coupling network. Hence, a complete subspectrum can be obtained for every 
amino-acid, making resonance assignments considerably easier!9S, TOCSY is 
based on the homonuclear Hartmann-Hahn (HOHAHA) effect!94. Magnetization 
transfer via the Hartmann-Hahn effect occurs only when the difference in the 
absolute magnitudes of the effective rf fields experienced by two coupled spins is 
smaller than the scalar interaction between the two spins. The effective rf field is 
the vector sum of the resonance offset and the applied rf field. In general, two 
coupled spins have different chemical shifts. One way to make the difference 
between their effective rf fields smaller is to supply a strong rf irradiation through 
composite pulse decoupling schemes, such as MLEV17!°. In the pulse 
sequence shown in Figure 1.3, trim-pulses (1-2 ms) before and after the 
MLEV17 mixing pulses are used to give the pure phase for the diagonal. One 
possible artifact comes from transverse NOE effects. This effect is always 
positive and NOE cross-peaks are of opposite sign to the cross-peaks from 
Hartmann-Hahn transfer. Since NOE cross-peaks are usually weak and easy to 
identify, together with the fact that Hartmann-Hahn transfer is very efficient, 


transverse NOE effects are not a major problem!°%, 
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The nuclear Overhauser enhancement or nuclear Overhauser effect 
(NOE)196 is the fractional change in the magnetization of one spin as a 
consequence of perturbing the population of other spins. NOE effects are due to 
dipolar interactions (through-space) between different nuclei and are 
proportional to the inverse sixth power of the internuclear distance. In the 
NOESY107,108 (2D nuclear Overhauser enhancement) experiment, the mixing 
period consists of two 90° pulses separated by a mixing time Try, during which 
cross-relaxation involving longitudinal magnetizations occurs. The maximum 
distance sampled depends on the mixing time (a mixing time of 100 ms for a 
protein of molecular mass 5000 yields cross-peaks from pairs of protons closer 
than 3A). In an ideal situation, distances of 5A can be detected. The length of 
Tm is varied randomly by +10% to suppress scalar-coupling effects!1°9. The 
cross peaks in NOESY correlation maps provide evidence of dipole-dipole 
coupling between nuclear spins in close spatial proximity. 

The ROESY experiment measures cross-relaxation rates in the rotating 
frame. This experiment was originally proposed by Bothner-By et al. and was 
termed CAMELSPIN'!9, but was subsequently renamed ROESY, i.e. rotating- 
frame Overhauser effect spectroscopy!'!. In this experiment, a strong, on- 
resonance transverse radio frequency (rf) field is applied to the spins at the end 
of the evolution period ty. This causes the spin magnetization to become "spin 
locked" parallel to the rf magnetic field. The on-resonance transverse ff field is 
usually achieved by a series of a pulse of tip angle and a delay. Because of their 
different precession frequencies during ty, different spins have different amounts 
of magnetization aligned along the rf field. Net transfer of transverse 
magnetization can then take place between spatially proximal spins. As with 
NOESY, the ROESY experiment allows all pairs of spatially proximal spins to be 


detected simultaneously. 


aps 


A ee an 
vm do i, 


 «* 


Wy “eyo 10°. Wee actin yweumshevO 163/D 
) ‘al le bo = ij Fi apni? ignoiraa’ ‘edt e 


te Pal St 1 rr Ve : i? | 


) noimuged at pctidrredy me 


‘qgubste sioatie SOU eniga | Wo Fay SI 
bas wmioul iorst! nowveed feanae-cipaontl) anode yy rt 
aottsiat layed) aio .Weg 4 seievm Sl OF Mae wi 
ive ; eae ’ nev setvwt OS) aor,* 


‘Yond? Bac Pam eeet ceziag “OC mw, ™ tino 


rT Ty Oriya a r 


ep 7 Hak Dad : 
ys 


= - , _— 
rot ert GO? to ob Gnbdeh &) orth Grbods ert m6 zoneged Yelqnee - 
| () we ohLe 


win utuoelom 


i + 7 " a _ 
shay 1a ~ eeovieie Ieee ne iil AAG Rete 
- . 
: « ‘ Late ey ~~ > 
225s andqlouvslane) epee: CTS YO vimobNa belay ‘ay 
j : Se “rt «at - —" - r 
sieve > sbivow eniGy Yom? ay | P20 
7 i] 
=e * 
: ft 2 i 2enge salou neewhed, sTHQUOS 
ro _< _ 
> A * 
7 : - , se a 
va! wlaraier2cq tewtaar inomieqns VESOR iiss 
> 


‘bre im to v@ounitieS vd peetotig VIRAGD Baw Inemnedee att an a) 
; ee: 


tenn el YEIOR lewiens “uel? new hid Prarie = ernie 


Ww ..eods 6 Jne SNe Y Me 4 hy necand oft aie setts WwSURT ov 
3 orl te enige af of Setage Ole) (a @oneap ant cna serevensit onpar 
‘ou ; 
sue ” Moga oF t T Hy , ie 7) rae et.) 7 eeeur 3 ei ‘T yi boing ne oye py 


si uot th eeiaveria?. esnsnage+ne Gt “htan 3 aig a) h oot OF 
jer Ss seseged .yelob & >a igre & Mo deuy ee paige & yd be 
zmbome taswitit evel znice ‘etibe. Monit, 2onneupell 
amevanayt % releriuy, (olf -talgit a aw, “ ce oes 
(Rw 2h & lamieoig seas vosuned an | 


" 7 pola 
Oe Oberrigd lentixaiy ieee ta a al a ae ie + 


In ROESY experiments, transverse magnetization undergoes cross- 
relaxation and the rate of cross-relaxation therefore depends on spin-spin 
relaxation processes. In contrast, in NOESY experiments longitudinal 
magnetization undergoes cross-relaxation and the rate of cross-relaxation 
therefore depends on spin-lattice relaxation processes. Consequently, cross- 
relaxation in the rotating frame has a different dependence on molecular motion 
than cross-relaxation in the laboratory frame. In the rotating frame, the cross- 
relaxation rate is always positive. In contrast, in the laboratory frame the cross- 
relaxation rate changes from negative to positive with increasing molecular size. 
For medium size molecules, the cross-relaxation rate in the laboratory frame can 
be close to zero, resulting in unobservable NOEs. This means that the ROESY 
experiment is particularly attractive for moderately sized molecules like small 
linear peptides. For bigger molecules, cross-relaxation in the laboratory frame is 
an energy-conserving process and this leads to substantial spin diffusion. In 
contrast, for slow molecular motion cross-relaxation in the rotating frame is 
tempered by dipolar relaxation, therefore attenuating spin diffusion which means 
more accurate measurements of cross-relaxation rates and better descriptions of 
molecular structure. Finally, combined with NOESY, ROESY provides an 
attractive method to study molecular motion. Measuring cross-relaxation rates 
both in the rotating frame and in the laboratory frame may allow the nature of the 
molecular motion to be defined! 12. 

It is possible that some artifacts could appear in the ROESY 
experiment!!2, Among them are ones mediated by chemical exchange and the 
most important coherence transfer mechanism via J-coupling, i.e., COSY type 
transfer and homonuclear Hartmann-Hahn transfer. Multistep magnetization 
transfer is also possible. The cross-peaks resulting from COSY type transfer 


can usually be recognized by their anti-phase character. The cross-peaks 
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resulting from chemical exchange and Hartmann-Hahn transfer show a positive 
absorption, while the real NOE cross-peaks show negative absorptions. To 
reduce the Hartmann-Hahn transfer in the ROESY experiment, a small angle 
pulse (30°) for the transverse rf field and a moderate spin-lock field strength 
should be used. Hard 90° pulses immediately before and after the spin-lock are 
also used to reduce the resonance offset effects. 

A conformational analysis of peptides or proteins by NMR starts with 
sequence-specific resonance assignments9¢. The most successful and rigorous 
procedure for sequential assignments is that proposed by WiuUthrich and co- 
workers9¢, based on the observation of sequential NOESY or ROESY cross- 
peaks. The overall strategy can be divided into two parts; (1) identification of 
spin systems using mainly H-H spin-spin connectivities, which can be done by 
using COSY, DQCOSY and most efficiently by TOCSY experiments; and (2) 
sequential assignments of these spin systems based on the observed NOEs 
between adjacent residues. starting with unique residues and unique pairs of 
residues for which the NOEs give the highest probability of being sequential. 
The short distances that are important for sequence-specific assignments are 
distances between the @ proton of one residue and the NH proton of the next 
residue, the distance between the £B proton of one residue and the NH proton of 
the next residue, and the distance between sequential NH protons. Once 
sequence-specific resonance assignments have been obtained, NOEs can be 
attributed to specified locations in the sequence. Based on this specific NOE 
information, combined with further NMR measurements, such as chemical shifts, 
coupling constants, amide-proton exchange rates and temperature dependence 
coefficients of amide protons, the conformation of the molecule can be analyzed 


in detail. 
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Chemical shifts do not give conformational information directly. Indirectly, 
their values do have some implications for conformation analysis, such as the 
involvement in hydrogen bonds and closeness to an aromatic center, when the 
chemical shift values are quite different from their average values. In some 
cases, chemical shifts can serve to monitor conformational changes in the 
molecule when the environment changes. Recently, with the increase in 
chemical shift data for protein molecules, correlation between chemical shifts 
and secondary structures of c-helix and [B-sheets in proteins has been 
demonstrated!13,114, 

1H-1H homonuclear coupling constants can be measured from 1D or 2D 
NMR experiments. Among 2D experiments, COSY and DQCOSY are the most 
common ones. When the coupling constants are comparable to the linewidth, 
care must be taken since the antiphase cross peak separation does not reflect 
the coupling constant93. In this case, other experiments, such as ECOSY115 
and P.E.COSY1'1® or direct simulation of cross peaks!!1”? may be used. 
Homonuclear coupling constants can provide valuable information for the 
conformation by means of the well-known Karplus equation!!8 using the Karplus 
constants obtained from the globular protein BPTI (Basic pancreatic trypsin 
inhibitor)119, combined with NOE information. In principle, the backbone 
conformations of peptides are defined by the torsion angles 9, wy and w of each 
residue, which, in turn, can be derived from the Ha-NH coupling constants and 
the strength of the nuclear Overhauser enhancements. The torsion angles 9, 
describing the rotation about the N-Ca bond, can be estimated by using the 
Karplus equation directly. In general, as many as four different dihedral angles 
may correlate with a given coupling constant. In peptides and proteins, however, 
» angles are found to be in the range from -30° to -180°7S and, for coupling 


constants smaller than 6.0 Hz, they fall into the -50° to -90° range. In addition, 
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NH(i)-Hp(i) NOE information can often provide the necessary information to 
distinguish between 9-values around -80° and around -160°, both related to a 
7.0 Hz coupling constant. Assuming that only the three staggered side-chain 
rotamers are energetically favorable, and allowing a deviation of +/- 30° from the 
ideal x1 (x1 is the dihedral angle between C@(i) and CB (i) atoms) values of -60°, 
+60° and 180°, the distance range between NH(i) and any of the two B-protons 
(i) is found to be between 2.2 and 2.8 A for » = -80° and between 3.0 and 3.6A 
for » =-160°129. In the first case, medium to strong nuclear Overhauser effects 
are to expected, whereas in the second case only weak interactions should 
occur. The w angles cannot be derived from homonuclear coupling 
constants'@1, but they often can be estimated satisfactorily from the Ha(i)- 
NH(i+1) NOEs. Assuming trans peptide bonds (w = 180°), the distance between 
Ha(i) and NH(i+1) is minimal, and hence the NOE maximal, for wy = +120". 
Within deviations of +/- 30°, the change in distance is very small!2° and clearly 
below 2.5 A, which is generally considered as the upper limit for NOEs to be 
labeled as "strong"2*. Medium and, in particular, weak NOEs allow less 
definitive conclusions about the y angles, and should therefore be considered as 
approximations. The peptide bond angle w is, except for peptides containing 
proline, considered to be 180° (trans). The conformation of side-chains is 
analyzed in terms of side-chain rotamer populations! for which clearly resolved 
Ha-Hp and Ha-Hf' coupling constants are a prerequisite. 

The most important experimental structural information obtained from the 
NMR studies is the NOE between two spins from NOESY or ROESY 
experiments since distance information can be gained from NOEs. For a pair of 
spins, the buildup of crosspeak intensity for a short mixing time is approximately 
linear with a slope proportional to T>/r6 (T. is the overall molecular tumbling 


correlation time and r the distance between two nuclei). In principle, a more 
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quantitative estimate of the distance can be made by using the buildup curve22, 
crosspeak intensity vs. mixing time. Since the correlation time is not known in 
general, distances between spins may be calculated from ratios of crosspeak 
intensities, using one crosspeak for which the distance is known (i.e. geminal 
pair), provided the effective correlation time is the same for all pairs of spins. If 
spin diffusion occurs, or different parts of the molecule have different correlation 
times, more complicated procedures, such as complete relaxation matrix 
analysis, must be used to get correct distances !29. 

Temperature dependence coefficients of amide protons, Ad/K, can be 
determined from studies of the chemical shifts of amide protons at different 
temperatures. Smaller values of Ad/K usually indicate that the amide protons are 
involved in hydrogen bonds or are otherwise inaccessible to solvent molecules. 
For large molecules, exchange rates of amide protons can be measured from a 
series of spectra taken in a completely deuterated solvent. Slow exchange rates 
are an indication of hydrogen bond involvement and of less structure fluctuation. 
Relaxation measurements (both Ty and T9) can be used to detect the presence 
of other motion, such as chemical and conformational exchange. 

In principle, most of the NMR information can be transferred into 
geometric constraints over the conformation for the molecule studied. Based on 
these constraints, the molecular conformations can be generated by using 
distance geometry and molecular mechanics/dynamics calculations!@4.125, 
However, the general rules above can only be applied when the studied peptide 
possesses a relatively well-defined conformation or rigid conformation, which 
might be the case in some protein molecules. In the case of small linear 
peptides, the peptides possess an unlimited number of conformations and 
conformational interconversions not requiring peptide bond rotations occur in a 


time scale of nanoseconds. Since NMR is a method with a time scale of 
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seconds to hundredths of seconds for the usual NMR measurements, all 
measurable parameters for small linear peptides are therefore population- 
weighted averages of the conformational states available to the peptide, which 
precludes numerically precise interpretation of the data. Nevertheless, 
secondary structures can be identified by some characteristic distances, 
coupling constants or both for that specific secondary structure. Some important 
characteristic distances and coupling constants for various secondary structures 
are listed in Table 1.292. Wright!?6 summarized a list of accepted criteria, some 
of which are considered more definitive than others, that can be used to define 
the secondary structures of small linear peptides in solution. 

The criteria for turn formation include: (a) dyyn(i, i+ 1) NOE connectivities 
between residues 3 and 4 of the turn (type | turns also have a dyj(2,3) 
connectivity); (b) a dgn(2,4) NOE connectivity; (c) appropriate SJHNa coupling 
constants (not always seen); (d) a low temperature coefficient for the NH of 
residue 4 of the turn, indicating hydrogen bonding (not always seen). Such a 
hydrogen bond is not a requirement for turn formation, at least not in proteins!27; 
(e) characteristic CD or vibrational spectra. 

The criteria for helix formation include: (a) a series of relatively strong 
dNN(i, i+1) NOE connectivities, usually accompanied by a diminution in the 
magnitude of the dqn(i, i+1) NOE; (b) reduced SJHNa. coupling constants; (c) 
medium-range dqni(i, +3), dNN(i, +2), dapii, i+3), and dgpii, i+4) NOEs; (d) a 
pronounced negative ellipticity at 223 nm in the CD spectrum ( the wavelength of 
the minimum of the 200 to 208 nm band is also a good indicator of helical 
conformation; (e) characteristic vibrational frequency bands in Raman or FTIR 
spectra. 

The criteria for B-sheets are characteristic interstrand dgNii, j), ANNI, J) 


and daali, j) NOEs. 
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Circular Dichroism 

It is well known that molecules such as peptides and proteins containing 
asymmetric carbon atoms are optically active and exhibit the phenomenon of 
optical rotatory dispersion (ORD), which is the measurement of the wavelength 
dependence of the "dispersion" of optical rotation, and of the circular dichroism 
(CD) caused by different interactions of left-handed circularly polarized light 
(LHCPL) and right-handed circularly polarized light (RHCPL) with chiral 
molecules!*8, A solution of optically active molecules absorbs or refracts 
LHCPL and RHCPL to different extents. If €, and €R are the molar absorption 
coefficients for LHCPL and RHCPL, the molar circular dichroism is defined as Ae 
= €| -€R, which naturally gives rise to the observable property of ellipticity. If ny 
and nR are refractive indices for LHCPL and RHCPL, the circular birefringence 
or optical rotation is a measure of (n,_-nR), which gives rise to optical rotation. 
Because of the close relationship between absorbance and refraction, CD and 
ORD properties are interrelated by a Kronig-Kramers transform. However, 
because of the greater sensitivity of measurements of the circular dichroic 
absorbance than of the ellipticity, CD has replaced ORD for the conformational 
analysis of proteins and peptides!28,129, 

Circular Dichroism (CD) is a very sensitive technique. In practice, 
differences of as little as one part in 106 in absorbances are sufficient to 
generate readily observable effects in CD. Probably the most extensively used 
CD technique for the exploration of peptide and protein conformation in solution 
is the CD spectrum in the far-ultraviolet (UV) region (170 nm to 240 nm), where 
the backbone amide chromophore absorbs. Since the arrangement of peptide 
bond chromophores with respect to each other and to asymmetric fields in the 


molecule will vary with different conformations, far UV circular dichroism is 
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exquisitely sensitive to conformation. CD also has a very fast time scale (10°15 
sec.) and is able to detect an individual conformation for the peptides which 
undergoes fast exchange with the others in term of the time scale in NMR. The 
disadvantage for CD is that spectra of peptides and proteins are composed of 
broad overlapping bands arising from each amide chromophore of the molecule. 
Consequently, detailed interpretation of CD spectra for molecules larger than di- 
or tripeptides is difficult and information provided is not site-specific. 
Furthermore, several different conformations could yield similar CD spectra. 
Fortunately, the characteristic CD spectra of regular elements of 
secondary structure present in proteins and large polypeptides are well 
understood from experimental and theoretical work!28-131, An ideal a-helix 
usually shows a typical double minimum shape with minima at 222 and 208-210 
nm of approximately equal intensity, a maximum at 191-193 nm, which is about 
twice as intense as those negative bands, and a positive shoulder near 175 
nm'82, For a less than ideal a-helix, the intensities of the three major bands 
could be quite different from the ones of an ideal a-helix'S!. The 349 helix is 
predicted to have a spectrum similar to that of the a-helix. The length of the 
helix is also an important factor for the overall shape of the CD spectrum. It is 
predicted that a minimum of 2 turns (~ 10 residues) are necessary to provide an 
a-helix-like CD spectrum, while 50 residues provide an accurate model of an 
infinite helix'31, @-sheets are more diverse than helices; their CD spectra 
depend on the polarity (parallel versus antiparallel), the number of strands, and 
the number of residues per strand and their twists. A B-sheet could show a 
negative band above 200 nm, a positive one near 200 nm and another negative 
band below 170 nm133,134. Parallel and antiparallel B-sheets have been 
predicted to have qualitatively similar CD spectra'S8. Among the many variants 


of the ®-sheet, the positions of the bands can be shifted by several nanometers 
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and their magnitudes varied. The CD spectrum of the unordered form generally 
shows a strong negative band near 200 nm and a very weak band around 220 
nm, which can be either a positive band or a negative shoulder. The magnitude 
of the 200 nm band can vary with the molecules and the conditions135, The 
situation for the CD spectra of B-turns is even worse, because f-turns have more 
variants. Theoretical analysis predicts that type |, Il and III ®-turns, which 
account for about 80% of f-turns in proteins!96, should have similar CD 
spectra!s’. The most common CD spectrum for turns shows a weak negative 
band between 220-230 nm, a positive band between 200 and 210 nm and a 
strong negative band between 180 and 190 nm. However, CD spectra of some 
B-turns may be quite different. 

More importantly, the chiroptical properties of various conformations in a 
molecule are assumed to be additive. The contributions due to non-peptide 
chromophores below 250 nm are frequently neglected, This may be justifiable if 
aromatic groups and disulfide bonds account for less than 10% of the residues. 
The CD spectrum of a protein can be deconvoluted in terms of contributions 
from different components, such as a-helix, B-sheet, reverse turn, and so-called 
random coil, provided that the reference spectrum for each component can be 
evaluated. Reference spectra can be obtained from model compounds!29:133, 
or calculated from CD_ spectra of proteins of known secondary 
structure130,138,139_ With such approaches, CD data can give statistically 
significant estimates of many classes of secondary structure, if data are 
measured from 250 nm to 184 nm. If data extending only to 200 nm are used, 
the presence of a-helix is generally the only one that can be determined with 
confidence130.140, jn one case, correlation coefficients between X-ray structure 
and predicted structure for CD data measured to 178 nm are 0.97 for a-helix, 


0.76 for B-sheet, 0.49 for B-turns and 0.86 for other structures'41. 
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In principle, the same methods used to identify and deconvolute protein 
CD spectra can be used to study small linear peptides. A population of 
secondary structure will be mostly readily detected by CD in a small peptide 
because the ellipticity is measured per residue and would be diluted out in a 
longer peptides by the signals from the rest of peptide. However, care must be 
taken because of the dynamic properties of small peptides. In peptides, 
secondary structures are less ideal and more flexible, so that their CD spectra 
could be quite different from those of ideal secondary structures8!,82, 
Deconvolution could be very difficult, because of the absence of reference 
spectra. Especially, the reference spectra for fB-turns are difficult to obtain and 
have relatively low ellipticities compared to a-helix or B- structure, which means 
that decomposing a complex spectrum into various structural features including 


turns involves large errors. 
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(a). Schematic Representation of Type Il Collagen Structure 


N-Telopeptides | Triple Helix | C-Telopeptides 

a-4 a-1 

a-1 a-4 

a-1 a-1 
(b). Type! Collagen in Bovine and its Telopeptides 

Triple 
N-Telopeptides Helix C-Telopeptides 
a-1 pELSYGYDEKSTGISVP SGGYDLSFLPQPPQEKAHDGGRYY 
a-1 pELSYGYDEKSTGISVP SGGYDLSFLPQPPQEKAHDGGRYY 
a-2 pEFDAKGGGP SGGYER 
16 amino acids ~ 1000 24 amino acids 
a.a 
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(c) Type II Collagen in Human and its Telopeptides 


Triple 
N-Telopeptides Helix C-Telopeptides 
a-1 pEMAGGFDEKAGGAQLGVMQ GPGIDMSAFAGLGPREKGPDPLQYMRA 
a-1 pEMAGGFDEKAGGAQLGVMQ GPGIDMSAFAGLGPREKGPDPLQYMRA 
a-1 pEMAGGFDEKAGGAQLGVMQ GPGIDMSAFAGLGPREKGPDPLQYMRA 
19 amino acids ~ 1000 27 amino acids 
aa 


(d) Type Ill Collagen in Bovine and its Telopeptides 


Triple 
N-Telopeptides Helix C-Telopeptides 
a1 pEYEAYDVKSGVAGG GGGVASLGAGEKGPVGYGYEYR 
a1 pEYEAYDVKSGVAGG GGGVASLGAGEKGPVGYGYEYR 
a1 pEYEAYDVKSGVAGG GGGVASLGAGEKGPVGYGYEYR 
14 amino acids ~ 1000 22 amino acids 
a.a 


Figure 1.1 Schematic representations of a Type II collagen monomer structure 
and telopeptide sequences in Type |, Type Il and Type Ill 


collagens. 
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Figure 1.2 Standard nomenclature for the atoms and the torsion angles along 


a polypeptide chain. 
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Figure 1.3 Pulse sequences for some 2D NMR experiments. 
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Table 1.1. Dihedral Angles of Hydrogen Bonded £ and y Turns: 


y 
Type | -60 -30 -90 0 
Type I’ 60 30 90 0 
Type Il -60 120 80 0 
Type II’ 60 -120 -80 0 
Type III -60 -30 -60 =30 
Type III 60 30 60 30 

Type Vla (cis) -60 120 -90 0) 

Type VIb(cis) -120 120 -60 0 

y-turn 70 to 85 -60 to -70 

Inverse y-turn -70 to -85 60 to 70 
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Table 1.2. Characteristic Short Distances (A) and Coupling Constants (Hz) in 
Some Regular Secondary Structures 


————e—e—e—eses SSS 


a-helix 310 B-sheet Type 12 Type II@ 
dan 3.5 3.4 21D 3.4 212 
ye 3.2 
dan (i,i+2) 3.8 3.6 chic 
dqN(i,i+3) 3.4 3.3 
dNN 2.8 2.6 2.6 
2.4 2.4 
dNN(i,i+2) 3.8 
dgn° 25-41 29-44 3.247 29-44 3.6-4.6 


3.6-4.6 3.6-4.6 
do p(iit3)2 25-44 — 3.1-5.1 


Coupling 4 4 4c 45d 
Constants 
a) For the turns, the first two numbers apply to the distance between 


residues 2 and 3, the second to the distances between residues 3 and 4. 


b) The ranges given correspond to the distances adopted by a B-methine 
proton if %4 is varied between -180 and 180°. 


C) For residue 2 in the turn. 


d) For residues 2 and 3, respectively. 
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CHAPTER 2 


Solution Conformation of the Type | Collagen Alpha-2 
Chain Telopeptides Studied by 1H and 13C NMR 
Spectroscopy” 


Introduction 

The collagens are a family of high molecular weight proteins produced 
and secreted by connective tissue cells such as fibroblasts, chondrocytes and 
osteoblasts, to perform various functions in the intercellular space. The most 
abundant form of collagen, genetic Type |, is found as relatively thick fibrils, 
approximately 20 to 200 nm or more in diameter, depending on tissue and 
age!:2. When organized into fibres and fibre bundles, these give mechanical 
strength to tissues such as dermis, tendon, bone and dentine. Fibrils closely 
resembling those seen in vivo precipitate from acidic solutions to pure 
monomeric Type | collagen on neutralization and warming?4. Therefore the 
information necessary to direct this complex macromolecular assembly process 
is present in the sequence and higher orders of structure of the collagen 
monomer. Through most of its length the collagen molecule is a triple-helix of 
three polypeptide chains (two a-1 and one a-2 chain). The triple-helix is flanked 
by short non-triple-helical sequences (the "telopeptides'), 16 and 24 residues 
long in the case of the N- and C-terminal ends, respectively, of the a-1 chain, 
and 9 and 6 residues long for the N- and C-terminal ends of the a-2 chain of 


bovine type | collagenS. In spite of their relatively small size, the telopeptides 


A version of this chapter has been published. Liu, X., Scott, P. G., Otter, A. & Kotovych, cr 


(1990), J. Biomol. Struct. Dyn. 8, 063-080. 
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seem to play critical roles in fibril assembly, since their removal by proteolysis 
seriously impairs this process®.’”, These observations imply the existence of a 
set of specific, presumably conformation-dependent, non-covalent intermolecular 
associations between telopeptides on one molecule and the triple-helix of one or 
more adjacent collagen monomers within the growing fibril. The solution 
conformations of the telopeptides are therefore of considerable interest. We 
have recently described 2D-NMR studies on the a-1 chain N-8 and C- 
telopeptides9 which provide further insight into their roles in collagen 
fibrillogenesis. We present here the results of similar studies on both N- and C- 


telopeptides of the a-2 chain. 


Materials and Methods 

In the present study two telopeptides were studied, namely the 9-amino 
acid N-terminal a-2 chain telopeptide pGlu'-Phe?-Asp3-Ala4-Lys5-Gly®-Gly ’- 
Gly8-Pro9-NH2 (pE1-F2-D3-A4-K5-G®-G7-G8-P9-NH2) and the C-terminal a-2 
chain telopeptide N-acetyl-Ser1-Gly2-Gly3-Tyr4-GluS-Phe® (NAC-S1-G?-G3-Y4- 
E5-F6) Their structures are shown in Figure 2.1. Both peptides were 
synthesized commercially. The nonamer (IAF, Montreal, Quebec) was judged to 
be pure by HPLC and NMR. The hexamer (API, Edmonton, Alberta) was found 
to contain inorganic impurities which were removed by HPLC as described 
previously. 

The NMR samples were prepared by dissolving the purified solid material 
in ca. 0.2 mL water (mixture of D2O and H20). Then the pH was adjusted by 
adding small aliquots of diluted NaOH or HCl, followed by adding the rest of the 
solvent, which was 0.3 mL CD3OH for most of the samples. To change the 
solvent at the same pH, the sample was freeze-dried and redissolved in the new 


solvent. Before the ROESY experiment the sample was degassed with argon. 
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The concentration of the nonamer was approximately 13 mM and of the 
hexamer 10 mM. The 1H spectra were also run on the solutions which were 
diluted 10 times from the working samples under similar conditions, to confirm 
that there was no aggregation or other structural changes with concentration. 
The pH dependence of the 'SC resonances was carried out on a 28 mM sample. 
At pH 5.1 there was no dependence of the !SC shifts on sample concentration. 

All NMR experiments were carried out on Bruker AM400 and AM300 NMR 
spectrometers at ambient temperature (295K), except for the temperature study. 
The temperature dependence of the amide protons was calculated by comparing 
spectra recorded at 3 temperatures in the range from 273K to 293K with a 
maximum deviation of + 0.3. Data collection was controlled by an Aspect 3000 
computer using 1987 DISNMR software. For data processing the same 
computer and software were used, except that the Fourier transformations were 
carried out on the ASPECT 1000 computer with an array processor. The 
chemical shifts were measured relative to CD30H (Merck, Sharp & Dome, MD- 
67, 99.2% D) at 3.30 ppm (1H) and 49.0 ppm ('8C), respectively. For the 
aqueous sample, a trace amount of CD30H was added as the chemical shift 
reference. 

One-dimensional proton spectra were recorded with a sweep width of 20 
ppm and 16K (zero-filled to 32K) data points in both water and water-methanol 
solvents. The presaturation of the solvent peak(s) was performed during a 3s 
relaxation delay at the lowest possible decoupler power to avoid saturation 
transfer and possible wipe-out of the exchangeable protons. For the sample in 
the CD30H/H2O solutions, the frequency of the decoupler was changed every 
0.3 s between two decoupler positions, one at the OH-signal and another at the 
signal of the undeuterated fraction of CD3OH at 3.30 ppm. To account for the 


big difference in intensity between the two solvent resonances, the former signal 
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was Saturated twice as long as the methyl group. The same basic procedure 
was followed for the aqueous sample, except that the decoupler was switched 
between three positions, namely on the water resonance and + 5 Hz to either 
side of it. Finally, the resulting FID's were resolution enhanced by means of a 
Gaussian multiplication with a line broadening factor typically of -3 Hz. 

The 1'8C spectra were recorded in the mixed solvent using a DEPT 
(distortionless enhancement by polarization transfer) pulse sequence!® for the 
protonated carbons. DEPT spectra were recorded with a sweep width of 160 
ppm and 32K data points. The total relaxation delay was 1.8 s (including the 
acquisition time) and the total evolution delay for the polarization transfer 8.3 ms. 
The final proton detection pulse was 135°, creating positive signals for CH and 
CH3 groups and negative signals for CH2. Proton broadband decoupling was 
performed during the acquisition by means of a WALTZ"! composite pulse 
technique. The FID was multiplied with an exponential function (line broadening 
factor 1.5) prior to Fourier transformation. 

The parameters for the various two-dimensional NMR techniques are 
summarized in Table 2.1. The COSY12, ROESY1%17 and the '8C-1H correlation 
experiment using DEPT for polarization transfer'8 were all carried out in the 
phase-sensitive mode using time-proportional phase increments!9. Solvent 
suppression was achieved in the COSY and ROESY experiments as described 
for the one-dimensional spectra. In addition, continuous irradiation was applied 
to the water signal at all times except during acquisition®@°, The phase constants 
were determined by using the second FID which was multiplied by the weighting 
function, Fourier transformed, and then phased. The phase constants thus 
determined were applied to all spectra in the tg domain. No phase correction 
was done in t; except for the ROESY experiments where a basic 90° phase 


correction was required. The resulting two-dimensional matrix was then usually 
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slightly rephased in order to obtain the best possible phase in the area of 
interest. The ROESY spectra showed some baseline distortions around intense 
diagonal peaks. Therefore, the spectra were subjected to a baseline correction 
routine*!, which fits a polynomial function of fifth degree to the baseline (ABS 
baseline correction), and the problem was eliminated almost completely in both 


dimensions. 


Results 
Proton NMR 

The spectral analysis of the nonamer and the hexamer (Figure 2.1) is 
based on the results of the phase-sensitive COSY'19 and ROESY13.14 
experiments. The following discussion will detail the data for the nonamer but 
similar procedures were followed for the hexamer. 

First of all, a pH titration was carried out in aqueous solutions (80% 
H20/20% De2O) (Figure 2.2). It is clear that at a pH of 5.06 or higher the NH 
proton of D3 has undergone a large shift because its f carboxyl group is now 
deprotonated. The solvent was then changed to 60% CD30H, 20% HzO and 
20% D20. This is a structure-inducing solvent mixture that we have used in the 
study of other peptides®.9, The contour plot of the COSY experiment in this 
solvent mixture is shown in Figure 2.3 (fingerprint region). The COSY 
experiment, combined with the ROESY (Figures 2.4a and 2.4b) experiment, 
allowed the complete assignment of all the protons of the nonamer. The 
spectrum was recorded with the transmitter approximately in the middle of the 
spectral range (ca. 5 ppm). H,(i)-NH(i+1) ROEs (Figure 2.4a) were obtained, 
allowing the sequential assignments. The biggest difficulty was in distinguishing 
the G6 and G7 proton resonances but this was resolved using the ROESY 


experiment. The results of the proton data are shown in Table 2.2, both at pH 
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5.06 as well as at a pH of 3.00. Since ROE cross-peaks show a non-negligible 
offset dependence®', they often cannot be observed at the edge of a spectrum, 
irrespective of the internuclear distances involved. This is especially true for NH 
to NH interactions which are extremely important for the determination of 
secondary structures in peptides and proteins@*. As such interactions were not 
observed with the transmitter at ca. 5 ppm, a ROESY experiment was performed 
with the transmitter set in the middle of the NH region. NH-NH ROE's, if present, 
would not have been observed because the DS-A4-K5-G® NH-resonances are 
too close together and the cross peaks cannot be identified. The spin-lock 
conditions were achieved by a series of hard pulses of small flip angle@9 and the 
spin locking time was set to 100 ms at an average field strength of 4.0 kHz as 
was found suitable for peptides of similar size in previous studies. The spectra 
were basically free of any cross-peaks due to magnetization transfer between 
scalar-coupled spins (Hartmann-Hahn magnetization transfer) 14: 23, 

The results of the spectral analysis of the proton data (Table 2.2) also 
includes the results of a temperature study of the exchangeable protons. The 
nonamer was recorded at four temperatures and the temperature coefficients 
were calculated. The temperature coefficient calculated for NH (G®) of 3.2 ppb/K 
at a pH of 5.06 is indicative of shielding from the solvent, probably because of 
hydrogen bonding. When the pH is lowered to 3.0, unfortunately there is an 
overlap of the NH protons for G6 and G’ and a reliable temperature coefficient 


cannot be determined. 


13C NMR 


The 18C spectrum of the nonamer was first recorded for the protonated 


carbons by means of a DEPT12 sequence applied to the 28.0 mM and 13.8 mM 


samples in water/methanol solutions. There was no concentration dependence 


al 


nidipayentorn 6 wors aleda-c20 AOR -eprl 00.5 tog 5 ie 2s Te 


x oF 
mutineda 6 loth ayitite beviesd0 40 INANS? CBhe yar? wy 
i vw? eur yteisages eicutl .ovdverressneiin ealsameaey 


~ 


, wolmiinetaa aff Ww? Were Vinmetxs 995 ‘iiriw 
4 


on aree-aribiiastein( fisue. cA S-chieith brie eabiiqag aewatoute rt 
tarmiotied an ao cwvetp S07 : Mae <> 62 Tew yrrretase IS eet) eiiw 
Meso FT FSOrM MA OK ‘4i4 SF TO atbibimn crt! “ise Jefvnenst 
oot (Gea-hv41 az) WPA of a2un9e8 bewser'é naed aver sont te 
HoGtrige of watities) ed Jonas” aipsd seo ert ine lerieger: 


ine CSuipcs ad lipme to asta bigd lo ehene ye vavorroe Slew 


as sHin 4 & 70 Ain e Dist os tive 1S 16. om why ra] igs 2hw om g 

orl 2a volver oi evic lehwiietc 2etiedg 1) exishues 

wel sie Juedisopen ot aur clasa<c2zo Wie sent | 

2 = (ielerey nonesieangam Mish-r srtvel) aruqe 

ic (© shtaT) Oleh hioja “SAl 1b clabtede isnaade eA te ehueer 

<!Y onolow oidelerttncdonde eriPto Yhute ciffebnfaihe iy Shuseye 

ywiniiveos SiueIOTIO GOF GMs Shiite ota Wot Ho Seni 291 eaw 
Niky $6 22) Hot tof Reneiialspeirigpiiedo 4 hoy HAP SET > ots 

to eeuecdd Yidedoig insvog Shai oaitenta ovina a Bt 


ne i overt! ViewOoRoti (ie oh naewor etic enh ment gn 


“theisitecs suteegee! aetere bas O bas 8D 10f enatog HA € 


Batenoien ot xt bebiciar i ae Mamaton A te ong 
ihn @.Er bre kien 0.86 sctrut b 


of the 1SC chemical shifts, within 0.1 ppm. Hence the resulting spectra are 
depicted in Figures 2.5a and 2.5b, as a function of pH at 28.0 mM. The doubling 
of the 'SC resonances is due to the fact that the backbone 138C resonances see 
both NH and ND substitution and hence a two-bond isotope effect24 due to the 
mixed solvent CD30H/D20/H20 (60/20/20). The magnitude of this isotope effect 
averages 0.11 ppm. The sample was then subjected to a 13C/1H_ two- 
dimensional phase-sensitive'8 correlation experiment. As can be seen in Figure 
2.6 (a and b), all resonances can be assigned and the results are summarized in 
Table 2.3. 

Because of the presence of two charged side-chain groups Asp3(D3) and 
Lys°(K°), a charge-charge interaction has to be considered. As can be seen 
from Figure 2.5, the D,° carbon undergoes a 0.91 ppm low-field shift due to 
deprotonation of its carboxyl group on changing the pH from 3.00 to 5.1. Over 
the same pH interval, Dg? undergoes a shift of +3.38 ppm. On the other hand, 
from pH 5.1 to pH 12.0, K,° shifts by +0.62 ppm, Ks° by +4.05 ppm and K,° by 
+1.41 ppm. These low-field shifts arise first from the deprotonation of the DS 
carboxyl group, followed by the deprotonation of the NH3+ of K®. Hence, based 
on these data and in the absence of nuclear Overhauser effects between the two 
side-chains under consideration, it was difficult to prove or disprove the presence 
of a salt-bridge at pH values below the pKa of NH3+*(K°) and above the pKa of 
COO-(DS) (expected to be about 10.5 and 4.5, respectively). 

The results for the hexamer will now be considered. The proton 
assignments were determined based on the COSY experiment, combined with 
the ROESY. The results, in CD30H/D20/H20 (60/20/20 volume percent) are 
presented in Table 2.4. The temperature dependence of the NH protons is also 
included. In the absence of a small NH temperature dependence, or small JNH- 


CHa coupling constants or long-range ROE's, it was concluded that the hexamer 


oy 


wi 


| | 
a e we. vy © 4 a ' aie w i 
wf kh 


e “ook. af entong iA arts to per snares: eS eT 


m 


1s witane igetebadtann ace ima in ata tom 
Weiwot oT or 0.8 38 tq oO fC Janu) 6 66 ad-lt wt oC. & 
> seorminees: D5! eqodiosd oft tSsAl Mebee of eup cress 


i og 6 ry Pi b Og 4) 2 Shen ofs Folhuritat naiene 


OF aes ‘7 # us 


‘ P a 7 
|. : d < alka bith be un ta Pl yj ry "e 
ae gantae) air io ehulncem 2 §. AOS) Ore WOE 2 a 


; + ~~ 


- _ 


at TOC. of berasidlie nett gawd. ahumee st > age ee 


7 


swell ni neces ed neo sf, tnaciheqre attends’? levilages-oeang 


' § —_— Petr ee LT ei } 
+ DACNeTiwie fie ¢ (; ori Cos DernciRmes, eons fone. ot) He sd - 
£8 olde 
4 


teri (3 : LOTR sghla bagiahd own sania ‘Gg aril to 6e5098) 


‘ - ei. 

nacn Sc | A bmehianos. ed OF ger nce sini si itiospisns Ss PAPeY 
P ee 

toh ite bish wo! mad (2.0 6 @sor'saew | Oa emt 25 awelt i ol 


4) t.aw mon Ha.sm STDTIETO NC CBee hijad eff 10 ne 


baun varto att 9D Bo.5+ toMirle 6 esomebrt "oS IRV Peaks on 


1 °4 Bite (jg ony 1 Sen rie £0 .0+ nes PR ty) AN Ea - -” 
Gj alt to polisneimaden srt mot Sit -saie etic tie fhe vant oe a 

> 
based .eoriaht N to tela)! act lo rratwendiaiaesh eal ne bowollot Wwe 


Owe ort rs iad sioalit: Peles ee) ) shu io Sande he, a ee 
79 

AGNFEO1G & lie cpt oe i Suerte Pit sh PS ry Sey it hy vr) ae} 

to pd ort evads bee CA giay! % vig or: wale eeulaw Hepie 


o — a? 
(ylievtosgesy €4 bos C0! juodbed meals yee) 


2 : 
noo ertT, bayebienos or won i TaMENeeh ari wi 
rtiw bormdmpo Inemhede YEOD srt iho ) east seiahanti 


pis (ineneG einulay OS's ta) re eect comin ot 


ol 


ys Mare 90 cia aialieadl 
ig ious i NON 


ad 
4 


exists in an extended or random conformation. For completeness, the 183C NMR 


data were also obtained. These are presented in Table 2.5. 


Discussion 

In principle, the NMR data (NH-H, coupling constants, ROE distance 
constraints, amide temperature coefficients) provide the necessary information 
to estimate the torsion angles 9, wy and w@5 of the peptide backbone. Due to 
conformational averaging, which is always present in the peptides under 
consideration, the measured effects are the weighted average over two or more 
conformations making a numerically precise interpretation of, for example, the 
coupling constants impossible. Nevertheless, certain characteristics of the NMR 
parameters are diagnostic for the presence of secondary structure? namely, 
(i) small or large NH-H, coupling constants of one or more residues, 
(ii) NH(i)-NH(i+1) nuclear Overhauser effect(s) due to short interproton 
distance(s), (iii) reduced temperature coefficient of one or more of the amide 
protons, and (iv) a low-field shift of some of the amide protons2¢. This low-field 
shift together with a reduced NH temperature coefficient is clearly indicative of 
the formation of a hydrogen bond, whereas the coupling constants and the 
ROEs reflect altered torsion angles » and y. Depending on the nature of the 
non-extended secondary structure, and to what extent it coexists with different 
types of extended conformers, the magnitude of the effects might be small and 


therefore easily overlooked. 


Conformation of the nonamer (N-telopeptide) in methanol/water 


A review of the 1H NMR data summarized in Table 2.2 reveals that this 
peptide is a representative of short linear peptides with some secondary 


structure. Most of the coupling constants and temperature coefficients as well as 
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the ROEs (Figure 2.7) are indicative of an extended conformation. However, the 
relatively small NH-Hq coupling constant measured in A4 together with a 
reduced NH temperature coefficient of G® suggest that the peptide is either not 
completely extended or that a small proportion of the molecules exist in a folded 
conformation. Moreover, the low-field shift of NH(A‘) strongly suggests that 
C=O(DS) takes part in a hydrogen bond. The most probable explanation of 
these observations would appear to be that the four residues D3-A4-K5-G§ of the 
nonamer form a Type | f-turn, stabilized by a C=O(D3)-NH(G®) hydrogen bond. 
The remaining residues are in an extended conformation. The magnitude of the 
effects discussed above and the observation of ROEs which are only consistent 
with an extended conformation, for example H,(i)-NH(i+1) and the 
complementary H,,(i)-Hsa:(i+1) throughout the peptide chain, suggest that only a 
minor fraction of the molecules adopts the f-turn conformation whereas the 
majority is extended throughout. It is not possible to assess the relative 
populations of the molecules in the two conformations in a numerically precise 
way, but circular dichroism spectra (P.G. Scott, unpublished data) suggest that 
the proportion of molecules with defined secondary structure is very low - no 
more than 5-10% of the total. In theory, the NH-NH ROE cross peaks could be 
used to distinguish between a Type | and a Type Il f-turn. As these could not be 
observed, we postulate a Type | #-turn because it is two to three times more 
common than the Type II, while the mirror images (Types !' and II’) are very 
rare26, In addition, protein secondary structure predictions suggest a Type | turn 
for D3-A4-K5-G6 26, 

As can be expected in view of the G8-P9 subunit, the peptide bond angles 
w are not uniformly trans (w = 180°). Evidence for cis/trans isomerism@7,28 


about the G8-P9 peptide bond is found in the 'H and 'SC spectra. The cis- 
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isomer is present to the extent of about 10%. The chemical shifts related to the 
minor isomer are included in Tables 2.2 and 2.3. 

Measurements were also carried out at low pH to see if the proportion of 
Type | B-turn changes. The proton data at low pH (3.00) (Table 2.2) prove to be 
inconclusive. Because of the overlap of the NH protons (G® and G7’), a reliable 
temperature dependence of these individual protons could not be measured. As 
well, NH(i)-NH(i+1) ROE's could not be detected at pH 3.00. Consequently, 
conclusions about the extent of secondary structure cannot be drawn. 

Calculations using the ECEPP/229 (data not shown) indicate that an 
extended conformation for DS-A4-K5-G®, stabilized by a salt-bridge interaction 
between the ionized COO: side chain of D? and the protonated NH3* of KS, 
would be of lower energy than the Type | f#-turn structure. Some physical 
evidence in support of the existence of a salt bridge comes from the observation 
of the relatively high pH dependence of the chemical shift of the NH proton of A4 


- the intervening residue (Table 2.2). 


Conformation of the hexamer (C-telopeptide) in CD3OQH/water 


The proton data in Table 2.4 show neither small JNH-cHa coupling 
constants nor very low NH temperature coefficients. Consequently we conclude 


that this peptide is extended throughout. 


Implications for collagen telopeptide-triple helix interactions 


In spite of their relatively small size compared to the telopeptides of the a- 
1 chain, the a-2 chain telopeptides may nevertheless contribute to collagen 
fibrilogenesis. Linear growth of fibrils is driven by interactions between 
molecules displaced by 4D (936 helical residues) and lateral growth by 1D (234 


helical residues) interactions. Experiments on proteolytically-modified collagen 
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strongly suggest that the N-telopeptides are primarily involved in linear growth 
and the C-telopeptides in lateral growth®2:31, |t can be see in Figure 2.8 that 
there is a greater potential for both charge-charge and hydrophobic interactions 
between the a-2 chain N-telopeptide and the adjacent segment of triple-helix 
displaced by 4D, rather than 1D. The converse is true for the C-telopeptide. In 
both cases an extended conformation would seem to result in better positioning 
of the telopeptide side-chains for interaction than would a folded structure 
involving one or more f-turns. Nevertheless, such structures have been 
predicted by others from the amino acid sequence%’2. The physical evidence 
from the NMR studies described here for an extended conformation of the a-2 
chain telopeptides does not support the predictions of probability-based 
algorithms but is in accord with the above observations on potential telopeptide- 


helix interactions. 
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Table 2.1. Summary of Experimental Parameters Used in the Two- 


Dimensional NMR Experiments 


a EES EE eee 


Parameters 

Sweep width in F2 (Hz) 

Sweep width in F1 (Hz) 

Matrix size (F1 x F2) before 
zero-filling 

Matrix size (F1 x F2) after 
zero-filling 

Evolution time 
Initial value (us) 
Increment (us) 

Number of scans (dummy 
scans) 

Acquisition time (s) 

Relaxation delay (s) 

Other delays (ms) 

Window functions for 2D 
Pi Gite) 

Shifts of window function in 


fractions of n(F1/F2) 


COSYPH 


3600 
1800 
912 x 4K 


1K x 8K 


60 


ROESYPH 


3600 
1800 
912 x 1K 


IKhoe2k 


XHDEPTPH 
5000 
1400 

64 x 4K 


256 x 4K 


0.41 
1.4 
4.2¢ 
EEG 


3.0/1 52 


letherhiegns To esis 


i 


T wi th ers oir es 
| cients? Pi, tenesertert 
ee. : 
get ‘Aas Ce 2 
fut OORT (si faa riibiw © —* 7 
nih x88 are es 
yril-ores 
Sh » “ wre t ro. (29 x FR eke te 
| pniii-ores. . 
it notional 
| (ey auley Iatfint = 
} | ee) inenmnott 7 
(! arene) mee toe , 
“2.6 ve 4 
COGit 6 
EY big ‘G8° sot 
°% 110.2 é ahs i -froltont wobrw 194 
a rk 


- 


For abbreviation of techniques, see text. 

Spin locking time at an average field strength of 4.0 KHz. 
(25136 - 14)"! 

S stands for a sine bell, E for an exponential function. 


Line broading factors in F1 and F2. 
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Table 2.2. Proton Chemical Shifts, Coupling Constants and NH 
Temperature Dependence of pE-F-D-A-K-G-G-G-P-NHo2 2 


NH Oo 6 y Others J AT 
(Hz) (ppb/K) 


pE1 7a86) OF 14018 OPH1072,2:3810:213)2126 
F2 8.23 4.67 2.93,3.19 8.6 6.9 
(8.28) (7.0) (8.1) 
D3 8.34 4.58 2.67 8.6 5.2 
(8.43) (7.7) (6.5) 
A4 Baayen 4:21 1.41 5.7 ( 
(8.17) (5.5) (6.4)¢ 
KS 8.31 4.27 1.86 1.45 5:1.68 7.4 5.3 
€:2.97 
(8.17) (7.8) (6.4)¢ 
el 8.23 3.92 6.0/6.0 3.2 
(8.26) (5.5/5.5) (5.3) 4 
G7 8.31 3.96 6.0/6.0 5.9 
(8.26) (5.5/5.5) (5.3) 4 
Gee 8.10 4.07 6.0/6.0 5.2 
(8.14) (5.8/5.8) (6.2) 
p9b AigGee 062.220 2.00 8:3.54 
5:3.64 
NH2? 6.97, 7.68 


(6,99,7.70) 


CN LS 
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The nonamer was studied at 13.8 mM in CD30H/H20/D20 (60/20/20). 
The pH was 5.06, whereas the data in brackets are at pH 3.0. The shifts 
are in ppm with respect to undeuterated CD3 of CD30H at 3.30 ppm 

The chemical shifts for the cis-isomer are: G8NH, 8.13; G8CH,, 3.72; 
P9CHg; 4.48; NHo2 7.88, in ppm. 

Two sets of data are not distinguishable because of spectral overlap. 


Two sets of data are not distinguishable because of spectral overlap. 
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Table 2.3. 13C Chemical Shifts of pE-F-D-A-K-G-G-G-P-NH> as a Function 


of pH? 
pH ot B Y y € C 
pE! 3.00 Stas) 26.21 29.94 
Shi! Seely, pda wiy AAS) AS 
Te WOM 1 3 | 26:17 29:93 
Fe 3.00 56.07 37.82 130 OO 29 50 ie) 2 7205 
aye i Ss}ts fe) 38.02 130.04 129.48 127.94 
12.00 55.84 37.93 130.05 127.94 128.01 
D3 3.00 51.08 36.43 
5.13 51,99 39:81 
12. 00%S52°35 39.74 
A4 3.00 51.086 17°38 
5.13 51.44 life Lo 
12:00 851732 1738 
KS 3.00 54.68 31223 *23110. 27.43 40.33 
ay ES) 54.98 30. Sfmepoosul epee oF 40.21 
12,008 55.20 CE. ARTETA SS, 41.62 
G6 3.00 43.52 
5aVS 43.74 
12.00 43.75 
G? 3.00 43.28 
S13 43.32 
12.00 43.34 
G8 3.00  42.62(42.40)° 
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arnt oe 
eo Ni cE. té 
sv MELE 
PE\f. Tres. VS. RE te 
N =7 
Ca fh PE-TE CY ES eocr 
5 . 


Pg 


P9(cis) 


5.13 42.86(42.37) 
12.00 42.68(42.38) 


3.00 61.37 30.56 25.30 47.69 
Dale 61.42 30.54 25.29 47.72 
12.00 61.42 30.55 25.30 47.77 
3.00 60.91 33.03 23.00 
o.16 60.92 33.00 23.00 
12:0038-60:95 33.00 23.00 


The samples were 28.0 mM in CD30H/D20/H20 (60/20/20). Experiments 
carried out at 13.8 mM indicate that the shifts are the same, within 
experimental error (0.1 ppm). Because the backbone resonances 
experience an isotope effect due to the presence of both NH and ND, the 
chemical shifts of the more intense peaks (NH substitution two bonds 
away) are presented. 

The two signals cannot be distinguished because of spectral overlap. 


The data in brackets are for the cis isomer. 
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Table 2.4. Proton Chemical Shifts, Coupling Constants and NH 
Temperature Dependence of NAc-S-G-G-Y-E-F@ 


NH a 8 y J AT 
(Hz) (ppb/K 


g1P 8.34 4.39 3.79,3.88 7.0 8.4 
Ge 8.59 3.92 6.5/6.5 7.5 
G3 8.26 3.84 6.0/6.0 5.1 
y4 7.94 4.52 2.80,2.99 8.0 6.4 
ES 8.39 4.22 1,29,1.82 4.03 7.8 6.3 


FS 7.47 4.40 2.94,3.16 8.0 4.4 


(a) |The sample was 10.0 mM, pH 5.5 in CD30H/D20/H20 (60/20/20). 
(b) |= The chemical shift of N-acetyl is 2.05 ppm. 
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Table 2.5. 13C Chemical Shifts of NAc-S-G-G-Y-E-F@ 


Ol 6 y 5 € C 
Sie 57.30 62.51 
G2 43.80 
G3 43.28 
v4 56.20 37.26 ah eer 116.31 
ES 55.12 28.94 34.68 
FS 57.04 38.75 130.42 129.25 127.46 


(a) The solvent is CD30H/D20/H20 (60/18/22 volume %). The sample 
concentration was ~ 10 mM, pH 5.3. Because the backbone resonances 
experience an isotope effect due to the presence of both NH and ND, the 
chemical shifts of the more intense peaks are presented (NH substitution 
two bonds away). 


(b) | The chemical shift of N-acetyl is 22.74 ppm. 
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The structures of the nonamer and the hexamer, namely the N- 


Figure 2.1 


terminal and C-terminal Type |, a-2 chain telopeptides. 
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Figure 2.2 The pH titration of the nonamer in aqueous solution. Only the NH 


protons are shown. The solution concentration was 13 mM. 
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Figure 2.3. The COSY spectrum of the nonamer. The fingerprint region is 
presented. The solvent is CD30H/H20/D20 (60/20/20 volume 
percent). The connectivities are labelled in the figure. The pH was 


5.06. The chemical shifts are listed in Table 2.2. 
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Figure 2.4a The low-field region of the ROESY spectrum of the nonamer under 


the same conditions as Figure 2.3. The contour levels are of 


opposite sign with respect to the diagonal peaks. 
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Figure 2.4b The high-field region of the ROESY spectrum of the nonamer 
under the same conditions as Figure 2.3. The contour levels are of 


opposite sign with respect to the diagonal peaks. 
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Figure 2.5a The low-field 'SC spectrum of the nonamer as a function of pH. 


CH and CH3 carbons are phased positive whereas CHo carbons 


are phased negative. The data are presented in Table 2.3. 
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Figure 2.5b The high-field 1'SC spectrum of the nonamer as a function of pH. 


CH and CHg3 carbons are phased positive whereas CHo carbons 


are phased negative. The data are presented in Table 2.3. 
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Figure 2.6a The low-field region of the 'H-'SC correlation spectrum of the 
nonamer in CD30H/D20/H2O0 (60/20/20 volume percent) at 28.0 
mM and pH 5.13. The chemical shifts are presented in Table 2.3. 
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Figure 2.6b The high-field region of the 'H-'3C correlation spectrum of the 
nonamer in CD30H/D20/H20 (60/20/20 volume percent) at 28.0 


mM and pH 5.13.spectrum. The chemical shifts are presented in 


Table 2.3. 
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Figure 2.7. The results of the proton NMR data in CD3OH/water, including the 
ROE data, of the nonamer, together with the two proposed 
structures, a type | B-turn and the extended structure. The 9UNH-Ha 
coupling constants are in Hz. The solid square indicates a low 
temperature dependence of the NH protons (3.2 ppb/K), whereas 


the open squares indicate a high temperature dependence (>5.2 


ppb/k). 
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74 88 
Alignment of N- (A) and C-terminal (B) a-2 chain telopeptides with 
sequences?3 of triple-helices displaced axially by 1D (234 
residues) and 4D (936 residues). Hydroxylysine is represented by 
Z. Only potentially interacting (charged and large hydrophobic) 
residues are shown. The a-2 chain is assumed to be in the middle 
(reference) position in the triple-helix?4. | The solid lines show 


potential favourable charge-charge interactions. 
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CHAPTER 3 


Conformational Analysis of the Type II and Type III 
Collagen a-1 Chain C-Telopeptides by 1H NMR and 
Circular Dichroism Spectroscopy 


Introduction 

Collagen is the protein that gives mechanical strength to cartilages, 
bones, skin, tendons, ligaments and teeth. The short N- and C-terminal non- 
triple-helical extensions ("telopeptides") present on each of the three a-chains of 
the fibrillar collagen monomers play key roles in the assembly of the collagen 
fibrils which give connective tissues their mechanical strength. Proteolytic 
removal of telopeptides (which range from 6 to 27 amino acid residues long, 
depending on the particular a@-chain and collagen type) either abolishes the 
ability of the collagen monomers to assemble or leads to fibrils with abnormal 
morphology. From these results it appears that the N-telopeptides of type | 
collagen direct linear growth of the fibrils', while C-telopeptides promote lateral 
growth2:3, Fibrillogenesis is believed to involve their interaction in the growing 
fibril with a specific region on the triple-helix of an adjacent monomer. Protein- 
protein interactions are by nature conformation-dependent and we have recently 
completed a series of NMR studies on the conformations in solution of all the N- 
telopeptides of type |, Il and III collagens, which are most abundant collagen 
types in the body, as well as the type | C-telopeptide*+®. The N-telopeptide of 
type | collagen forms a significant proportion of f-turn in an environment of 
reduced dielectric constant (60% methanol) which is thought to mimic conditions 


within the growing fibril6. Under the same conditions the type II collagen N- 
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telopeptide forms a fused {/y turn and the type III collagen N-telopeptide forms a 
"nascent helix"8. 

Type | collagen is found in bones, teeth, skin, tendons and ligaments, 
whereas type II collagen is found in adults only in hyaline cartilage, where it is 
the dominant form, and in vitreous humour. /n vivo, it is present as thin fibrils 
which are critical components in the physical integrity of cartilage. The N- and 
C-telopeptides of human type Il collagen are 19 and 27 residues long, 
respectively9. Type Ill is the second most abundant form of collagen in soft 
tissues other than cartilage. It is prevalent in tissues where distensibility is a 
major requirement for function, such as blood vessel wails, lung parenchyma, 
intestine and uterine wall!9, /n vitro, type Ill collagen has been shown to form 
thinner fibrils than type | and to reduce the diameter of copolymeric fibrils 
containing predominantly type |.'!' /n vivo, it appears that the fibrils in many 
tissues contain both types | and III1*. The type III collagen monomer consists of 
three identical a-1 chains. Sequence data are available for the bovine type III N- 
telopeptide, which is 14 residues long!S and the human'4 and chick!> C- 
telopeptides which are 25 and 23 residues long, respectively. 

While there have been attempts to predict secondary structures of 
collagen telopeptides'® there exist few supporting physical data. To 
complement our previous studies, the present paper presents the results of our 
two-dimensional NMR and circular dichroism studies of the type II and type Ill a- 


1 chain collagen C-telopeptides. 


Materials and Methods 


Materials and sample preparation 
Both peptides, the bovine type ll NAc-GPGIDMSAFAGLGPRE 


KGPDPLQYMRA and the chick type Ill NAc-GGGVASLGAGEKGPVGYGYEYR, 
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were synthesized on a Biolynx™ 4175 manual peptide synthesizer using Fmoc 
amino acid derivatives (Pharmacia Canada Inc., Baie d'Urfe, Quebec). The raw 
materials were purified by reversed-phase HPLC on a Vydac TP218 column 
(The Separations Group, Hesperia, CA). The resulting white products were 
judged to be pure, based on their amino acid compositions and NMR spectra. 

The peptide content (71.8 %) of the type II collagen C-telopeptide was 
determined by spectroscopic analysis for the tyrosine content of an aqueous 
solution containing 0.300 mg peptide preparation per mL at pH 5.4. The near- 
UV spectrum of the solution in a 1 cm pathlength cell was recorded on a 
Beckman Model 25 Spectrophotometer, the concentration of tyrosine being 
calculated from the absorbance at 275 nm and the molar extinction coefficient, 
1420 L/mol cm, of N-acetyl tyrosine methyl ester. 17 

A 4.2 mg sample of the type II C-telopeptide was dissolved in 140 uL of 
H5O. The pH was measured in the 5 mm NMR tube with a Cole-Parmer C-5990 
electrode and adjusted to 5.4 by adding small aliquots of 0.2 M NaOH. The rest 
of the solvent, 560 wl of CD30H (Merck, Sharp and Dome, MD-67, 99.9% D), 
was then added and degassing of the solution with argon was performed prior to 
sealing. The final concentration was ca 2.1 mM in a mixture of 80% CD30H and 
20% Ho0O. 

A 4.9 mg sample of the type III C-telopeptide was dissolved in 140 uL of 
H>O. The solution was filtered and transferred to a 5 mm NMR tube and the pH 
was adjusted to 5.5 as described above. The rest of the solvent, 560 uL of 
CD3OH, was then added and the solution was degassed with argon prior to 
sealing. The final concentration was 3.0 mM in a mixture of 80% CD30H and 


20% H20. 
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Circular dichroism spectroscopy 

The CD spectra shown in Figure 3.1 were recorded at 10 + 0.05 °C ona 
Cary model 60 spectropolarimeter with a model 6001 circular dichroism 
attachment with thermostated cell holder. The instrument had been modified to 
eliminate possible artifactual signals on passing through intense absorption 
bands. Slits were programmed for 1.5 nm bandwidth at each wavelength. The 
instrument was calibrated with dextro-10-camphor-sulfonic acid using the 
procedure suggested by Chen and Yang.'8 The path length, 0.05 to 2 cm 
depending upon wavelength range, was dictated by the absorbance of the 
solution. Mean residue ellipticities, [O],,,,(deg cm? dmol-'), were calculated in 
the usual fashion using the mean residue weight of 106, calculated from the 


molecular weight of the peptide. Each spectrum is the average of two scans. 


NMR spectroscopy 
All NMR experiments were carried out on a Varian UNITY500 


spectrometer using a proton-selective 5 mm probe with a 90° proton pulse length 
of 9.0 us at a transmitter power level of 60 dB. The 90° pulse length was 
checked at the beginning of every NMR session and the temperature was 
always controlled to +0.1°C. For data collection and processing, VNMR software 
(version 3.2 and 4.1) was used on Sun Spare 4/330 and Sparc 4/470 
workstations, respectively. 2D spectra were acquired non-spinning, whereas for 
one-dimensional experiments the spinner was turned on. All chemical shifts are 
measured relative to the undeuterated fraction of the methyl group of CD3OH at 
3.30 ppm. The temperature dependent coefficients of the amide protons were 
calculated by analyzing the chemical shifts at 0.0, 5.0, 10.0, 15.0, 20.0 and 
25.0°C (type II) and 0.0, 10.0, and 20.0°C (type Ill) by means of a linear 
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regression. Unless otherwise indicated, all data are reported at 0.0°C for the 
type II N-telopeptide and 10.0°C in case of the type III N-telopeptide. 

All two-dimensional NMR experiments'9 were carried out in the phase- 
sensitive mode by using the hypercomplex technique (States-Haberkorn-Ruben 
method.)*9 The decoupler and transmitter offsets were always set equal, a 
sweep width of 5000 Hz was used in both dimensions and typically 4K data 
points in t2 and 512 experiments in t1 (zero-filled to 1K) were acquired. The only 
significant exceptions from this scheme were the TOCSY?! experiments where 
800 t1-increments were acquired and an 8K by 2K data matrix was analyzed. All 
two-dimensional experiments were preceded by 8 dummy scans but none were 
applied between individual t1-increments. The preacquisition delay "alfa"?2 was 
carefully and individually adjusted for every 2D experiment. First point 
distortions caused by analog filters¢? were compensated by reducing the first 
point of each FID and of each t1-interferogram by empirically determined 
values.22 No further correction routines were necessary. 

The TOCSY experiments were carried out by using the basic pulse 
sequence proposed by Bax.°4 Water suppression was achieved by solvent 
presaturation during the relaxation period. A 2 ms trim pulse preceded the 80 
ms MLEV-17 spin-lock (field strength 7.0 kHz). Connectivities from the amide 
proton to H,(Lys) were readily observed after 8 scans were accumulated. 
Squared sine bell functions were used in both dimensions. The width of the 
functions was ca. 0.7 times t2 or t1, respectively, shifted by approximately x/2 in 
both dimensions. 

The NOESY25.26 experiments were carried out with the 90°-t1-90°-mix- 
90°-FID pulse sequence and 1.5 s solvent presaturation. A random variation of 
10% was applied to the mixing time. In the case of the larger peptide (type II), 


NOESY at six different mixing times, namely 50, 150, 250, 350, 450 and 550 ms 
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were recorded without random delay, and 32 scans per t1-increment were 
acquired, resulting in negative nuclear Overhauser effects. For the smaller 
peptide (type Ill), the NOESY experiments were carried out with 40 scans per t1- 
increment at a mixing time of 200 ms. After data processing with x/2 shifted 
squared sine bell functions large numbers of correlations were observed. 

The cross-peak volumes of the NOESY data sets obtained from the type 
Il telopeptide were quantified at mixing times of 50, 150, 250, 350, 450 and 550 
ms to check for linear build-up of the nuclear Overhauser effects. The cross- 
peak volumes were quantified in the following way: (i) cross-peaks were 
integrated and carefully checked for contributions from baseline distortions, such 
as t1 and t2 ridges, by integrating and, if necessary, subtracting the integral of 
nominally empty areas perpendicular to the actual cross-peaks; (ii) whenever 
possible, the volume was measured in both cross-peaks, above and below the 
diagonal, and the two values were averaged to minimize slight phase 
imperfections and other minor artifacts; (iii) where applicable, the volumes were 
normalized for the contributions of more than two protons to the cross-peak by a 
factor (2N3N,)/(Nat+Np), where N, and Ny are the number of protons responsible 
for the diagonal peaks a and b, respectively.” (iv) for the type II telopeptide the 
BB' interactions of residues Asp®, Asp29 and Tyr@4 were used as reference (1.8 
A); for the type Ill telopeptide the £f' interactions of the Tyr 17, 19 and 21 
moieties were used as a reference (1.8 A) to calculate the inter-proton distances. 
It was not possible to establish the degree of attenuation of the exchangeable 
amide protons in a precise manner because of spectral overlap. Qualitatively 
speaking however, the loss of intensity was small, as evidenced by the few 
cases where a precise integration and subsequent comparison with non- 


exchangeable reference protons was possible. The absence of correction 
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factors for the exchangeable protons renders the distance constraints longer, so 


that no unnaturally close proximities are created. 


Results 


Circular dichroism spectroscopy 

The comparison of CD spectra of the type II collagen C-telopeptide in two 
solvents is presented in Figure 3.1 from which it can be seen that the peptide 
has a virtually disordered conformation in water as solvent. However in 
CH30H/H»20 (80/20), it adopts a degree of ordered conformation, the CD profile 
of which is given in terms of the difference spectrum displayed in the inset to the 
figure. The features of the difference spectrum suggest, perhaps, 8% a-helix 
based on the relationship 

% a-helix = - (Mean Residue Ellipticity at 222 nm) / 29700, 

where - 29700 deg cm? dmol-! is the estimated value of the mean residue 
ellipticity for 100% helix formation of a sequence of 10 amino acid residues using 
the rationale and equation 5 of Chang et al.¢8 A subsidiary conclusion is that the 
peptide is not aggregated in CH30H/H2O (80/20), since decreasing the peptide 


concentration 5-fold had no significant effect on the CD spectrum. 


NMR spectroscopy 


The resonance assignments (Tables 3.1 and 3.2) of both telopeptides are 
based on TOCSY experiments2! and sequential NOEs of the type a(i)-NH(i+1), 
NH{(i)-NH(i+1) and £(i)-NH(i+1).29 


Type II collagen a-1 chain C-telopeptide 


The assigned resonances of the type II collagen a-1 chain C-telopeptide 


are summaried in Table 3.1, which also includes the temperature dependence 
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coefficients and coupling constants of the NHs (where available), and the 
relevant section of the TOCSY recorded with a mixing time of 80 ms shown in 
Figure 3.3. Vicinal coupling constants of the type SUH were measured in the 
one-dimensional spectra, at different temperatures as required to minimize 
spectral overlap. Whenever possible, the values of the coupling constants were 
checked over the temperature range -5 to 25°C, and changes of more than 0.3 
Hz were not found, an indication that the conformation does not change 
significantly. In all cases, the chemical shifts of the amide protons varied linearly 
with temperature with correlation coefficients better than 0.98, indicating that 
major conformational changes are not present in the temperature range under 
study. Several amide protons were observed to have smaller temperature 
dependence coefficients (below 3.0 ppb/K), which is a indication for a possible 
intramolecular hydrogen bond, or inaccessibility to solvent for steric reasons. 

NOE build-up curves for several pairs of proton in the type II collagen a-1 
C-telopeptide are depicted in Figure 3.3. One can see that, for the proton pairs 
whose distance could not change with the local conformation, the NOE 
intensities reached their highest point near the mixing time of 200 ms; while for 
ones whose distance could be changed with local conformation, the maximums 
were reached later near 450 ms. These differences could be caused by the 
combination of the rapid random motion of the molecule and the different 
correlation time at different parts of the molecule. 

NOE information for the type II collagen a-1 chain C-telopeptide was 
extracted from a NOESY spectrum with 10% random delay and a mixing time of 
150 ms. The observed effects are summarized schematically in Figure 3.5 (top). 
Basically, all the sequential NOEs, including NH(i)-NH(i+1), were observed along 
the chain, except where spectral overlap precluded measurements. Medium 


range and long range (|i-j| >= 2) NOEs, which are the most important indication 
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for the folded structure, were not found under careful examination, especially 
near the place where the lower temperature dependence coefficients were 


observed. 


Type Ill collagen a-1 chain C-telopeptide 


The assigned resonances of the type III collagen a-1 chain C-telopeptide 
are summarized in Table 3.2 and the relevant sections of the contour plot of the 
NOESY spectrum recorded at 200 ms mixing time are depicted in Figure 3.4. 
One can see that some amide-amide interactions overlap completely at 10°C. 
Recording the spectra at different temperatures does not solve this problem as 
the amide temperature coefficients are not significantly different. This is not 
surprising in view of the size of the molecule and its relatively unstructured 
nature. NOESY spectra were carefully examined for intraresidue (i = j), 
sequential (|i-j] = 1), medium range (|i-j] <= 4) and long range (|i-j] > 4) NOEs and 
the observed effects are summarized schematically in Figure 3.5 (bottom). 
Essentially an uninterrupted sequence of NH{(i)-NH(i+1) cross-peaks was 
observed. The continuous NH/NH NOE pattern is seemingly interrupted in 
places where spectral overlap prevented a quantification of the cross-peaks, and 
of course, around the Pro!4 moiety. From an overview of the spectroscopic data 
presented in Table 3.2 and Figure 3.5 it is evident that effects associated with a 
non-extended conformation are only found around residues 8 to 11. 

The temperature coefficients of the amide protons measured over a 20°C 
range are included in Table 3.2. Linearity was observed in all cases which is a 
good indication that no major conformational changes took place. This 
conclusion is confirmed by the NH-H,q coupling constants (where available) 
which also did not change by more than 0.1 Hz over the entire temperature 


range. The temperature coefficient of the amide proton of Glu’! (2.0 ppb/"C) is 
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clearly indicative for shielding from solvent whereas all the other coefficients in 
the range from 3.9 to 6.0 ppb/°C demonstrate essentially unrestricted 


accessibility to solvent. 


Discussion 

The NMR data presented in Tables 3.1 and 3.2 and the graphic summary 
thereof (Figure 3.5) indicate that the conformation in solution of the two 
molecules under study is somewhat different. As indicated by the parameters 
used for secondary structure determination by NMR (nuclear Overhauser 
enhancements, amide proton exchange rates39 and/or amide temperature 
coefficients), both peptides undergo conformational averaging on the NMR time 
scale. With a few exceptions,3’ this phenomenon has so far prevented 
quantitative conclusions about conformations of short linear peptides in solution. 
Unless the degree of conformational freedom is significantly reduced by disulfide 
bonds or binding to metal atoms,%2:93 the spectroscopic indicators of secondary 
structure34;35 are population-weighted averages over all conformations present. 
The non-linearity of the averaging process%® renders it essentially impossible to 
assess to what extent individual conformers are present. Only very extensive 
molecular dynamics calculations on time-scales of several nanoseconds can 


possibly solve this impasse.°7 


Type II collagen a-1 chain C-telopeptide 


Based on the NMR experiments, the conformation of the type Il C- 
telopeptide is mostly extended. In two regions where smaller temperature 
dependence coefficients for the NH protons were observed, which are one of the 
necessary indications of folded structures in small peptides, folded structures 


cannot be deduced because of the lack of other evidence. In the region of 
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residues Ser’ to Ala!°, small temperature dependence coefficient for the NH 
proton of Ala!9 and smaller coupling constant for the NH proton of Ala® are 
consistent with a B-turn structure, but the medium range H,(i) - NH(i+1) NOE, 
generally regarded as the most reliable indication for turn formation3!, was not 
observed. In the C-terminal, smaller temperature dependence coefficients were 
found for three NH protons, but severe spectral overlaps prevented further 
conformational analysis over that region. These smaller temperature 
dependence coefficients can only be interpreted in terms of local shielding from 
solvent for sterical reasons (large hydrophobic side-chains). 

The apparent 8% a-helix in CH30H/H2O (80/20) may be interpreted in 
terms of an average taken over a set of interconverting subpopulations, some of 
which have a random structure while others have a degree of ordered 
conformation, although not necessarily helical. Alternatively, it is at least 
conceivable that on the average 92% of the molecules are completely random, 
while 8% are completely helical. This result is a confirmation of the result from 
the NMR study. Combining these two studies, we conclude that there might be a 
very small percentage of folded structure for the molecule, most likely in the C- 


terminal end and between residues 7 and 10. 


Type III collagen a-1 chain C-telopeptide 
The conformation of the type Ill C-telopeptide is mostly extended (Table 


3.2 and Figure 3.5), except for the Gly8 - Glu!! segment where some indications 
of turn formation were observed. First and foremost, a very low (in comparison 
to other values, see Table 3.2) temperature coefficient was observed for 
NH(Glu11), indicating significantly reduced exchange with the bulk solvent. In 
view of the absence of any large hydrophobic side-chains around Glu", this is 


most likely due to hydrogen bonding. This conclusion is confirmed by a medium 
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range nuclear Overhauser effect between the a-proton of Ala? and NH(Glu"’). 
Unfortunately, the conformational analysis is severely hampered by four 
difficulties: (i) the presence of two Gly moieties in the putative turn limits the 
number of possible interactions drastically because of the absence of side-chain 
protons; (ii) vicinal NH-Hg coupling constants are basically meaningless for Gly 
as two a-protons are present; (iii) Gly? and Gly'° have identical H, and NH 
chemical shifts making the quantification of relevant NOEs other than the 
medium range effect mentioned before impossible; and (iv) the chemical shift 
difference between NH(Gly'9) and NH(Glu'!) is so small, that any NOEs 
between the two protons would be too close to the diagonal to be measured. 
However, the observation of the medium range H,(i) - NH(i+2) NOE, generally 
regarded as the most reliable indication for turn formation, makes the conclusion 
about the existence of a turn fairly certain. Although hydrogen bonding is not a 
prerequisite for B-turn formation®8, it appears to be the major stabilization factor 
in the present peptide. The decision whether the turn is type | or type II is 
difficult in view of the problems with spectral overlap. As the main difference 
between the two turns is the 180° rotation of the amide linkage between the 
second and the third amino acids of the turn (Ala? and Gly!9), few spectroscopic 
features are different in the two turns. The main differences are the very short 
distance H,,(Ala2) - NH(Gly1°) of 2.2 A in p-II (3.4 A in B-I) and the extremely 
long distance NH(Ala2) - NH(Gly19) of 4.5 A in £-Il (2.6 A in B-I, idealized turn 
values from reference 29). As both kinds of interactions are overlapped by other 
NOEs, no firm conclusion can be made based on this information alone. In all 
our studies to this date involving B-turns, we always found type | B-turns which is 
not surprising in view of their 2-3 fold predominance over type II.°9 In all those 
cases, it was found that the temperature coefficient of NH(i+2) was also reduced, 


although not as much as NH(i+3). This can be understood in view of the fact 
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that in the B-I turn the NH proton points towards the inside, whereas in the f-II 
case it is pointing to the outside of the turn where the exchange with the solvent 
would be unhindered. In the present case there is no reduction of the 
temperature coefficient of NH(Gly!°), and hence there is the potential of the turn 
being type Il. When analyzing the sequence in terms of turn probabilities,°9 a 


high probability for a B-Il turn at residues GAGE is found. 
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Table 3.1 14H Chemical Shifts@, Coupling Constants? and NH 
Temperature Coefficients© of the Type II Collagen a-1 Chain C- 
Telopeptided 

NH oe Hee Ae other H JNH-a NH temp. 
coeff. 
Gl S.o6uee 4.17 5.4/5.4 7.0 
3.97 
2s - a SOme eo ero gta e07 1,940 23.04 3,56 --- 
G3 8.73 3.96 6.0/6.0 5.6 
3.81 
\4 7.94 4.18 1.84 fda tela 5:0.82 4.2 
0.89(CH3) 
DS 8.53 461 2.82,2.64 7.0 5.8 
M& B68. 4:62 | 2:25,2.00 2.67,2.53 e: 2.00 6.4 6.2 
Ss? 8.65 418 3.96,3.91 5.0 3.4 
Ag 8.33 4.09 1.18 4.6 6.6 
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pi4 : 438. 2.23.2.23 2.071194 0:3.643.58 =: 
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e:NHe 7.03 
E16 88.44 4.26 2.04,2.04 2.32,2.32 6.3 2:3 
Kitano Seer e4ig4Wen1sa711 70) @14611746™ OF1'62'1.62 77.0 4.6 
€: 2.90 2.90 
Gis 8.21 4.11 6.5 4.8 
4.04 
p19 : 4.41 2.23,2.23 2.07,1.99 6:3.633.56 a : 
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7.80 4.02 23.0 2alGntee.O 32.03 SNHo: 5.4 0.8 
72(3,0:03 

7.82 4.45 O22) ads aL FEN Cw aE TE 4.6 
3970.12 

Pegs )8) 4.43 2.46,2.38 2.10,1.97 €:2.00 fT 

fU 4.37 1.88,1.76 1.63 6: 3.16 3.16 3.3 
NH: 7.60 

7.94 4.16 1.34 53 


Chemical shifts are reported in ppm relative to the methyl group of 
CD30H at 3.30 ppm downfield from TMS. 

Coupling constants in Hertz. Whenever possible, they were measured in 
one-dimensional spectra. When it was not available from one- 
dimensional spectra, it remained unknown. 

The temperature coefficients of the amide protons are listed in - ppb/*C. 
They are the result of a linear regression analysis of the chemical shifts 
measured at 0.0, 5.0, 10.0, 15.0, 20.0, and 25.0°C and accurate to within 
+/- 0.3 ppb/°C. 

The data were recorded at 500 MHz in CD3OH/H20 (80/20 v%) at a 
concentration of 2.1 mM. The pH was 5.4 and the temperature was 0.0 + 
ORS: 

Pairs of geminal protons x (x = B, y, 6, « and a(G)) are not assigned 
stereospecifically. When two distinct lines were observed, the larger 


chemical shift was arbitrarily assigned to x and the smaller to x’. 
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Table 3.2 14H Chemical Shifts4, Coupling Constants? and NH 
Temperature Coefficients® of the Type Ill Collagen a-1 Chain 


C-Telopeptided 


Nab RRS: He et H, & otherH  Jng NH temp. 
coeff. 
Gi 
er 
GS 8.42 ~— 3.97 
3.92 
V4 8.15 4.03 rasa 0.96,0.96 6.5 5.7 
A° 8.41 4.29 1.39 6.0 5.8 
Someee7.00.. 4.37  3,88.3.81 7.0 3.9 
ize Shige 4 G3) 81h /151,69 1.60 5:0.910.86 7.0 3.9 
Gs 8.42 ~—s 3.92 
3.92 
Romeo 4.31 1.39 5.5 4.6 
GiOSs 42 = 33.92 2x5.8 5.4 
3.92 
Elly 6346! «4225 we2.07,199ly <2:28;2:28 6.0 2.0 
emarsswe4 37. 1.89.1.72 1:44:1:44 0: 1.631.638" =47.5 5.0 
€: 2.92 2.92 
Gi3. 812 = 4.02 2x5.4 4.1 
4.02 
pi4 === 4.40 2.13,1.94 1.94,1.94 6: 3.55 = Ee 
VIS 8.21 4.08 B08 0.91,0.88 fee 6.0 
G6 840 3.92 2x5.5 5.5 
a4 
Wiens 27688 4.39 3,02,2.85 2,6: 7.03 6.0 5.4 
S57 0asa 
Gi8 856 3.96 2x6.0 4.7 
3.61 
Viwmees 02008 4.43 93,02 °2.85 267.01 7.5 5.6 
3,5: 6.70 
pecmes 77a 4i9 ) 1:91,1.88 2.16,2.16 6.0 5.3 


oF 


| 


8.02 4.53 3.08 2.85 eo TASS fis 4.3 
3,0; 6:73 

7.69 4.18 1-65,1¢08— 1,54,1.54 70; 3314 314 FON) 5.0 
e: 7.48 


Chemical shifts are reported in ppm relative to the methyl group of 
CD30H at 3.30 ppm downfield from TMS. 

Coupling constants in Hertz, measured in one-dimensional spectra. 

The temperature coefficients of the amide protons are listed in ppb/°C. 
They are the result of a linear regression analysis of the chemical shifts 
measured at 0.0, 10.0 and 20.0°C. 

The data were recorded at 500 MHz in CD30H/H90O (80/20 v%) at a 
concentration of 3.0 mM. The pH was 5.5 and the temperature 10.0 + 0.1 
(Gy, 

Pairs of geminal protons x (x = f, y, 5, € and a(G)) are not assigned 
stereospecifically. When two distinct lines were observed, the larger 


chemical shift was arbitrarily assigned to x and the smaller to x’. 
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Figure 3.1. Circular Dichroism spectra of the type II collagen a-1 chain C- 


telopeptide at 10 °C. For experimental details see "Materials and 


Methods". 
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Figure 3.2 


The contour plot of a phase-sensitive 500 MHz TOCSY experiment 
of the type II collagen a-1 chain C-telopeptide at a mixing time of 
80 ms. The concentration of the sample was 2.1 mM at pH 5.4 in 


CD30H/H90 (80:20), temperature 0.0°C. 
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Figure 3.3 


(a) - D°(NH) 


P*(c) - G3(NH) 


“\L22(NH) - Q25(NH) 


0 100 200 300 400 500 600 
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Plot of several build-up curves of the pairs of protons in the type II 
a-1 C-telopeptide indicated in the plot. The concentration of the 
sample was 2.1 mM at pH 5.4 in CD30H/H90 (80:20), temperature 
0.0°C. NOESY without the random delay at different mixing times 
of 50, 150, 250, 300, 350, 400, 450 and 550 ms were used. These 
experiment data were processed with identical parameters. The 


number for the intensity is arbitrary. See text for detail. 
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Figure 3.4 


Two sections of a 500 MHz phase-sensitive NOESY experiment of 
the type III collagen a-1 chain C-telopeptide at a mixing time of 200 
ms. The concentration was 3.0 mM at pH 5.5 in CD30H/H20 
(80:20), temperature 10.0°C. Bottom: the NH/NH region; top: 
H,/NH region (for clarity, only the peaks pertinent to the B-turn 


conformation around residues G8 to E11 are labeled). 
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Figure 3.5 


The amino acid sequences of the type II (top) and type III (bottom) 
collagen a-1 chain C-telopeptides with unique residues underlined. 
The observed sequential and medium range NOEs are indicated 
schematically by solid bars. The height of the bars is an 
approximation for the strength of the NOEs. Where proline 
moieties are present, the Ha(i) - Has'(i+1) instead of the H(i) - 
NH(i+1) interactions are listed. NOEs shown with dashed lines are 
likely to be present but could not be quantified due to almost 
identical chemical shifts of the protons. The temperature 
coefficients of the amide protons are shown with a solid square for 


values below 3.0 ppb/°C. 
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CHAPTER 4 


A Sequence-Dependent 1H NMR Study on the Formation 
of 6-Turns in Tetrapeptides Containing Charged 
Residues* 


Introduction 

The molecular mechanism involved in the folding of peptides and proteins 
remains one of the important questions in structural biology’. Recent studies2-5 
clearly demonstrate the ongoing interest in the factors favoring formation and 
stabilization of secondary structures at an early stage of the folding process. In 
particular, the observation of a stable B-turn in short linear peptides is important 
since these structures have been postulated to play a significant role in initiating 
protein folding®8. A turn is an important structural feature in peptides and 
proteins and is the site in the polypeptide chain where the polypeptide direction 
is reversed9:19, In the case of the f-turn, a hydrogen bond may be formed 
between the carbonyl oxygen of residue i and the amide proton of residue i+3. 
Even in short linear peptides, a beta turn can be stable, as observed for the 
pentapeptide Tyr-Pro-Gly-Asp-Val which is 50% beta turn at 5°C?. This 
conformation is in equilibrium with an extended conformation in water. The 
authors propose that it is the local amino acid sequence in the folded protein that 
determines the f-turn conformation because it is observed in short 
pentapeptides. Consequently, such a sequence is sufficiently stable so that it 
can initiate the direction of the folding pathway. The conformational space is 


significantly restricted by reverse turns in the folding polypeptide chain and this 


* A version of this chapter has been published. Liu, X., Scott, P. G., Otter, A. & Kotovych, G. 
(1992), Biopolymers 32, 119-130. 
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allows distant sections of a polypeptide to come into close proximity2:3. 

In the present study, the importance of side-chain charge interactions in 
the formation of B-turns is studied. The tetrapeptide NAc-DEKS-NH> has been 
previously studied'!. It forms a type | beta-turn which has been observed in the 
isolated tetrapeptide as well as in the type | collagen a-1 chain N-telopeptide!2. 
To study the importance of the side chains (their length, ionization state and the 
order within the sequence), the following protected NAc-tetrapeptide amides 
were synthesized: NEKS, EEKS, DDKS, DQKS, NQKS, DERS, NERS, EERS, 
DDRS, NDRS, DQRS and DKES. In addition, based on the statistical predictions 
of Wilmot and Thornton'3, the peptides NPDM, NSDM and NDDS were studied. 


Materials and Methods 

The tetrapeptides in this study consist of two series. The first series is 
related to our previous study of the type | collagen a-1 chain N-telopeptide (16- 
mer)'2 and the tetrapeptide DEKS"! and includes NEKS, EEKS, DDKS, DQKS, 
NQKS which have the same residues at positions 3 and 4 as in DEKS, but have 
different side-chains and charges for residues at positions 1 and 2; DERS, 
NERS, EERS, DDRS, NDRS, DQRS where K has been replaced by R at position 
3; and DKES which is the variant of DEKS obtained by switching the second and 
third residues. The second series include NSDM, NPDM and NDDS. These 
molecules were included in the present studies in view of the results of a 
statistical analysis and predictions of f-turns in proteins'!3. The sequence NPDM 
is composed of the four amino acids that are among the most often found in 
positions 1 through 4 in type | B-turns, and therefore could potentially serve as a 
model compound for the present investigation. Because of the absence of an 
NH proton at the important position 2 in NPDM, we included NSDM, the next 


probable f-turn forming tetrapeptide in the sequence. NDDS was selected since 
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it has the highest probability of forming a type | B-turn without the possibility of 
attractive charge interactions. All peptides were synthesized as the N-acetyl- 
amides. DEKS was from Alberta Peptide Institute, University of Alberta. NQKS, 
NPDM and NSDM were from Multiple Peptide Systems, San Diego, CA. All other 
peptides were synthesized on an LKB Biolynx!M 4175 solid-phase peptide 
synthesizer using Fmoc-amino acid derivatives (Pharmacia (Canada) Ltd, Baie 
d'Urfe, Quebec). All peptides were purified by reversed-phase HPLC on a Vydac 
TP218 column (The Separations Group, Hesperia, CA). Purity was confirmed by 
determination of amino acid composition. 

The NMR samples were prepared by dissolving the purified solid material 
in ca. 0.2 mL of water (H20). The pH was then adjusted by adding small 
aliquots of diluted NaOH or HCl, followed by adding the rest of the solvent, which 
consisted of 170 uL CD30H and 130 uL CD30D to make the solvent a 60/40 
methanol/water mixture. Before the NMR experiments, the samples were 
degassed with argon. The concentrations of the samples were between 3.9 and 
6.8 mmol/L and the pH of the samples was in the range of 5.5 to 6.6. Whenever 
the same peptide was studied at different pH, the second sample was prepared 
in the same way. For several samples, spectra at different concentrations were 
compared and the chemical shifts were found to be independent of the 
concentration, indicating the absence of aggregation or other structural changes. 

All experiments were carried out on Bruker AM300 and AM400 NMR 
spectrometers at ambient temperature (295K), except for the temperature 
studies. Three probes were used, namely a broadband probe on the AM300, a 
selective 1H probe and an inverse probe on the AM400. Since the inverse probe 
has a much higher sensitivity than the other two, it was used to carry out all of 
the ROESY experiments as well as experiments on the more dilute samples. 


The temperature studies were all carried out on the AM400. The temperature 
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dependence of the amide protons was calculated by comparing spectra 
recorded at different temperatures in the range from 283K to 300K and a 
maximum variation of +/- 0.3 ppb/K was observed for the temperature 
coefficients. Each temperature was calibrated by inserting a thermocouple 
inside an NMR tube, filled with the same mixed solvent, into the magnet before 
and after the experiments. When the temperature dependence was calculated, 
the calibrated and averaged value for each temperature was used. For 5 
peptides (DERS, NEKS, DDRS, DDKS, DKES) both the selective and the inverse 
probe were used for the temperature study, and the results are averaged values 
over 8 temperatures. For the remainder of the peptides the results were 
obtained from spectra at 4 temperatures using only the inverse probe. 

The FID acquisition was controlled by an ASPECT 3000 computer using 
1987 or 1989 Bruker DISNMR software. For data processing the same 
computer and software were used, except that the Fourier transformations were 
carried out on the ASPECT 1000 computer with an array processor. The 
chemical shifts were measured relative to CD30H (Merck, Sharp & Dome, MD- 
67, 99.2% D) at 3.30 ppm (1H). 

One-dimensional proton spectra were recorded with a sweep width of 10 
ppm and 16K (zero-filled to 32K) data points. The presaturation of the solvent 
peak(s) was performed during a 3s relaxation delay at the lowest possible 
decoupler power level to avoid saturation transfer and possible wipe-out of the 
exchangeable protons. The frequency of the decoupler was changed every 0.3s 
between the two decoupler positions, one at the OH-signal and another at the 
signal of the undeuterated fraction of CD3OH at 3.30 ppm. To account for the 
big difference in intensity between the two solvent resonances, the former signal 
was saturated twice as long as the methyl group. Usually a total of 64 scans 


were accumulated, and the resulting FID's were resolution enhanced by means 
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of a Gaussian multiplication with a line broadening factor of typically -3 Hz. 
Two-dimensional NMR_ experiments, i.e. correlated spectroscopy 
(COSY) '4 and spin-locked NOE (ROESY or CAMELSPIN)15.16, were employed 
for the assignment of the peptide spectra and for the detection of spectral 
features related to secondary structure. An example of the parameters used for 
the two-dimensional NMR techniques is given in Table 4.1. For different 
molecules, certain parameters such as sweep width, acquisition time and t4- 
increment were slightly different, since those parameters depend on the field 
strength of the spectrometer and were adjusted slightly in order to achieve the 
maximum resolution in both dimensions. The two-dimensional experiments were 
all carried out in the phase-sensitive mode using time-proportional phase 
increments(TPPI)'7._ Solvent suppression was achieved as described for the 
one-dimensional spectra except that the relaxation delay was 1.8s instead of 3s. 
In addition, continuous irradiation was applied to the water signal at all times 
except during acquisition'®. The phase constants were determined by using the 
second FID, which was multiplied by the weighting function, Fourier transformed, 
and then phased. The phase constants thus determined were applied to all 
spectra in the to domain. No phase correction was done in t, except for the 
ROESY experiments where a basic 2/2 phase correction was required. The 
weighting functions for both dimensions were sine-bell window functions. Shifted 
sine-bell functions were applied to obtain the best results in the fingerprint’ 
region of a COSY spectrum; on the other hand unshifted sine-bell functions were 
the better choice for the high-field regions of COSY spectra. The resulting two- 
dimensional matrix was usually slightly rephased in order to obtain the best 
possible phase in the area of interest. The two-dimensional proton spectra 
suffered somewhat from the appearance of t,-noise and ridges!9.29, but these 


problems were not serious enough to affect our analysis. 
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In the ROESY experiments the same conditions that were used in our 
previous study of the tetrapeptide DEKS'', were repeated here. The spin-lock 
conditions were achieved by a series of hard pulses of small flip angle@! and the 
spin locking time was set to 300 ms at an average field strength of 4.0 kHz which 
was found to be suitable for peptides of similar size in previous studies22. Since 
ROE cross-peaks show a non-negligible offset dependences, they often cannot 
be observed at the edge of a spectrum, irrespective of the internuclear distances 
involved. This is especially true for NH to NH interactions which are extremely 
important for the determination of secondary structures in peptides and 
proteins®4. Therefore, a ROESY experiment was performed with the transmitter 
set in the middle of the NH region if the NH-NH interactions were not observed 
with the transmitter set at ca. 5 ppm. When NH-NH interactions were observed, 
then the ROESY experiments were carried out at two additional mixing times 
(100 ms and 200 ms). Usually the ROESY spectra showed some baseline 
distortions around intense diagonal peaks. Therefore, the spectra were 
subjected to a baseline correction routine which fits a polynomial function of fifth 
degree to the baseline (ABS baseline correction), and the problem was 
eliminated almost completely in both dimensions. It is noteworthy that the 
ROESY experiments were almost free of any spurious resonances (identified by 
their phase relative to the diagonal) due to magnetization transfer between 
scalar coupled spins25:26, In the so-called "fingerprint" region@” no such effects 
were present, indicating suitable spin-locking conditions@'. 

Finally, the ROESY spectra were re-run on DEKS, DERS and NEKS at 
500 MHz on a Varian UNITY 500 NMR spectrometer, on which better sensitivity 
is available. The samples for these experiments (5 mM, pH 5.8) were prepared 
in the same way as described above. Spin-locking was achieved by a series of 


30° hard pulses, with two hard 90° pulses added at both ends of the spin-locking 
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period to compensate for the offset effect of the cross-peaks23. The strength for 
the spin-lock was set to 5.0 kHz and the spin-locking time to 300 ms. The 
sweep-width of the spectra was 5000 Hz and the transmitter was set on the 
water signal (ca. 5 ppm). Solvent suppression was achieved by presaturation 
during a 2s relaxation delay. The data were processed by using VNMR 3.1 
software which was run either on the SUN 4/330 or SUN 4/470 computer. The 
ROESY experiments of the three peptides were run and processed with exactly 
the same parameters in order to estimate the ROE intensities qualitatively. The 


plot of the NH-NH region for each of the three molecules is shown in Figure 4.1. 


Results 


(a) 1H resonance assignments 

The spectral analysis of the tetrapeptides is based on the results of the 
phase-sensitive COSY!” and ROESY'5.16 experiments. All spectra used for 
assignments were recorded with the transmitter set approximately in the middle 
of the spectral range (ca. 5 ppm). Since there was rarely spectral overlap in the 
two-dimensional spectra, the COSY experiments, combined with the sequential 
Har(i)-NH(i+1) ROEs?4 in the ROESY experiments, allowed the complete 
assignment of all the protons of the tetrapeptides without any difficulty. 

The assignments for all the peptides are summarized in Table 4.2, which 
also includes the results of a temperature study of the exchangeable protons for 
each tetrapeptide at nearly neutral pH together with NH-Ha coupling constants, 
which were obtained directly from the one-dimensional spectra. The data for six 


tetrapeptides that were studied at low pH are also presented in Table 4.2. 
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(b) Temperature coefficients of amide proton resonances 


The temperature dependence of the chemical shifts of the amide protons 
was determined over the range from 284K to 300K. In all cases, the chemical 
shifts varied almost linearly with temperature, indicating that major 
conformational changes are not present in the temperature range under study. It 
is believed that the temperature dependence of an amide proton reflects 
differences in the extent to which the backbone amide proton participates in a 
hydrogen bond, though not necessarily in a B-turn conformation. It is noticeable 
that the values for the backbone amide proton resonance of residue 4, which 
was Ser in most molecules, vary greatly between peptides, from 2.7 in DERS to 
6.4 in NDRS. A number of low temperature coefficients (below 4.0 ppb/K), which 
is indicative of shielding from the solvent, were observed, mainly for DERS, 
NEKS, DDRS, and DKES, probably because of hydrogen bonding. However, it 
has to be pointed out that since the temperature dependence of amide protons 
strongly depends on the solvent and the pH value, any quantitative comparison 
is difficult. The fact that small peptides in solution possess several 


conformations in equilibrium further aggravates this. 


(c) pH dependence of amide protons in selected tetramers 
In addition to DEKS, six tetrapeptides were investigated at pH values 


below the pKa of the Asp or Glu carboxyl groups. Upon lowering the pH, two 
interesting effects were observed for DERS and NEKS : (i) the chemical shifts of 
NH(E2) shifted to high field by more than 0.5 ppm; (ii) The NH-Ha coupling 
constant of E2 increased from 5.0 to 6.6 and 7.0, respectively. For other 
peptides we observed that the chemical shifts of NH(E) in the second and third 
position were subjected to a relatively large high-field shift of 0.3 ppm (EEKS and 
DKES) but only 0.1 ppm for the NH(E) in the first position (EEKS), «It is 
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interesting that the chemical shifts for NH(E)s in the second position were on the 
low field side of 8.7 ppm (EERS, EEKS, DERS, NEKS, NERS), while those for 
NH(E)s in the first or third position were around 8.5 ppm (EERS, EEKS and 
DKES). In comparing the chemical shifts of NH(E) and NH(D), NH(D) in the first 
position shifted to low field upon lowering the pH (DERS, DKES, DDRS and 
DDKS), and NH(D) at the second position shifted in the same direction but less 
in extent (DDRS and DDKS). 


(d) Use of the nuclear Overhauser effect to define conformations in solution 
Conventional NOESY spectra are inappropriate since the peptides of 
interest are of such a size that cross-peaks are either nonexistent or too weak to 
be reliably observed because the correlation time of these molecules at 300 and 
400 MHz is close to the rate at which the maximum possible NOE passes 
through zero28. An alternative approach for small molecules is the rotating 
frame NOE (ROESY) experiment. This experiment involves the use of a spin- 
lock following frequency labeling of a length sufficient to allow incoherent 
magnetization transfer between nuclei which are close together in space. The 
ROE remains positive irrespective of the value of the correlation time and the 
Larmor frequency. ROE cross-peaks appear as peaks with opposite intensity 
with respect to the diagonal peaks in the ROESY spectrum. The experiment 
must be used and interpreted with care, however, since artifacts can easily arise 
due to the presence of Hartmann-Hahn magnetization transfer!®.21,23,25,26, 
Such artifacts can be detected by altering the conditions under which the 
experiment is carried out, for example, by changing the frequency offset of the 
carrier, the field strength or the mixing time of the spin-lock. In the present 
study, Hartmann-Hahn magnetization transfer was only observed for the two 


protons of the terminal NH» group or NH» groups in the side chain. But the 
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appearance of these cross-peaks does not affect the analysis in any way. 

The cross-peaks between Ha(i) and NH(i+1), NH(i) and Ha(i), and NH(i) 
and H(i), which show connectivities between consecutive amino-acid residues 
or within the same residue, were observed for all the peptides. Besides these, 
well-defined ROEs between NH(i) and NH(i+1) were observed only for DERS 
and NEKS, and the patterns were the same as were found for DEKS, except the 
intensities were different. Although not used in any quantitative sense, ROE 
intensities were estimated qualitatively from two-dimensional contour plots and 
relative to the generally strong sequential cross-peaks. 

For DERS and NEKS, for which we observed NH-NH interactions in the 
ROESY spectra, we took particular care that the ROEs reported represent true 
proximity between protons. ROESY experiments were carried out at different 
mixing times (100, 200 and 300 ms), with the transmitter set in the middle of the 
NH region, and at a field strength of 2.0 kHz. This technique has been used for 
peptides from four to 27 amino-acids, and the resulting spectra were basically 
free of any cross-peaks that would = adversely influence the 


interpretation !2,22,29,30, 


Discussion 


The backbone conformations of the tetrapeptides are defined by the 
torsion angles 9, wy and w of each residue, which, in principle, can be derived 
from the Ha-NH coupling constants and the strength of the nuclear Overhauser 
enhancements?1-35, The conformation of side-chains is analyzed in terms of 
side-chain rotamer populations?® for which clearly resolved Ha-HB and Ha-Hp' 
coupling constants are a prerequisite. This analysis is complicated when the 
studied peptide does not possess a well-defined or rigid conformation. In the 


case of tetramers, the peptides possess an unlimited number of conformations 
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and all measurable parameters are therefore population-weighted averages of 
the conformational states available to the peptide, which precludes a numerically 
precise interpretation of the data. A list of accepted criteria for turn formation, 
some of which are considered more definitive than others, can be used, which 
includes: (a) dyni(i, i+ 1) NOE connectivities between residues 3 and 4 of the turn 
(type | turns also have a dyn(2,3) connectivity); (b) a dgn(2,4) NOE 
connectivity; (c) appropriate SJHNa coupling constants (not always seen); (d) a 
low temperature coefficient for the NH of residue 4 of the turn, indicating 
hydrogen bonding (not always seen). Such a hydrogen bond is not a 
requirement for turn formation, at least not in proteins?’; (e) characteristic CD or 
vibrational spectra. 

In our previous study of DEKS'1, four pieces of evidence were clearly 
observed (Table 4.3), i.e. the Sup, coupling constant for E2 was rather low 
(4.8 Hz), the temperature dependence for S4 was quite low (2.3 ppb/K), and 
NOE connectivities between NH(E2) and NH(K3) (weak), NH(K3) and NH(S4) 
(medium) were observed. A CD spectrum showed that DEKS did possess a 
proportion of ordered structure. Based on this evidence, we concluded that 
DEKS formed a type | £-turn in the solution. In the present study, basically the 
same set of data were observed for DERS and NEKS, i.e. a small coupling 
constants for E2 (5.0 Hz) in both cases, a lower temperature dependence for s4 
(2.7 ppb/K for DERS and 3.8 ppb/K for NEKS), ROEs between NH(E?) and 
NH(R3) (weak), NH(R3) and NH(S4) (medium) for DERS, NH(K3) and NH(S4) 
(weak) for NEKS. In view of the NMR data on these two molecules, which are 
summarized in Table 4.3, we believe that NEKS and DERS form a type | f-turn 
in solution, as does DEKS. It must be noted that one piece of evidence leading 
to the detection of a ®-turn, the dgn(2,4) ROE connectivity, was missing in all 


three molecules. The reason for this might be that the population of B-turn in the 
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solution is so low that the experimental method is not sufficiently sensitive for 
this connectivity to be detected, since, in a type | f-turn structure, the dyn(2,4) 
ROE is a much weaker interaction (3.6 A) than the two dNN ROEs (2.6 and 2.4 
A)24. The detection of a sequential d,pyy(2,3) ROE, which is much stronger than 
the dNN(2,3) or dyn(3,4) ROEs, shows that in all three peptides, a smaller 
proportion of molecules in the type | B-turn conformation coexist with a larger 
proportion in extended conformations. A strong dgn(2,3) ROE is only consistent 
with an extended conformation (2.3 A) but cannot arise from the B-turn structure 
(3.4 A). It has to be noted that in the case of small peptides in solution, it is 
impossible to classify all the conformations into two classes only, for example, B- 
turn and extended. This makes it impossible to assess the ratio between the 
molecules in the extended conformation and those adopting a f-turn in a 
numerically precise manner, based only on the NH-Ha coupling constants, the 
temperature coefficients and ROE intensities. The lack of standard values for 
the temperature coefficient of each residue in a certain conformation make it 
even worse. Nevertheless, based on the present NMR data, we conclude that 
approximately the same proportion of type | (j-turn exists in solutions of DEKS 
and DERS. NEKS has a smaller population of B-turn, since the dyyy(2,3) ROE, 
which is weaker than the dyyy(3,4), was almost invisible in the ROESY spectrum 
and the temperature coefficient of S4 was relative large. 

Apparently the charge-charge interaction is dominant with regard to the 
secondary structure, especially in low dielectric constant solvents such as the 
methanol/water mixture22. Salt-bridging has been observed previously in model 
tetrapeptides38 where the charge interaction was observed between moieties 1 
and 4. Recently salt-bridging has been observed to be a stabilizing force in the 
formation of an a-helix8, where the salt-bridge sets the amino terminal boundary 


of the helix. Previously, we postulated’! that a salt-bridge between the side- 
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chains of E and K in DEKS, stabilized the B-turn conformation. For DERS where 
K is replaced by R, the f-turn is still present. By replacing E in DEKS or DERS 
by Q, the f-turn disappeared in DQKS and DQRS. Under low pH conditions, all 
evidence for B-turns disappeared for the three molecules. In the case of NEKS, 
this is significant as there is only one possible charge-charge interaction. These 
results clearly showed the importance of the salt-bridge in stabilizing the f-turn. 
Another interesting result is that the charge on the side-chain of the first-residue 
also stabilized the 6-turn conformation. Replacing D in DEKS by N still maintains 
the 6-turn in NEKS, but the population of the 6-turn drops. Replacing D in DERS 
by N destroys the 6-turn in NERS. All other replacements tested also destroyed 
the B-turn. It seems that a minus/minus/plus charge pattern and the exact 
lengths of the first and second side-chains are both important. These charges 
set up such a delicate balance to stabilize a f-turn that the loss or addition of a 
side-chain methylene group (D1 to E! or E2 to D2) is sufficient to disrupt the p- 
turn conformation as in the case reported earlier39, where an E to D substitution 
disrupted the balance between several charge-charge interactions, resulting in 
partial unfolding of an enzyme. 

It is of interest to compare our results with the secondary structure 
predictions derived using the parameters of Wilmot and Thornton's (Table 4.4). 
Of the sixteen tetrapeptides studied, only five were not predicted to adopt a f- 
turn. Three of the five carried Q in position 2 and therefore could not form the 
salt-bridge. In the other two, D at position 1 was replaced by E. A general 
observation is that this method leads to overprediction of f-turns in the isolated 
tetrapeptides studied here. The three tetrapeptides for which we did find 
evidence by NMR of f-turn formation (DEKS, NEKS and DERS) were all 
predicted to adopt a type | f-turn but were by no means those assigned the 


highest probabilities for doing so. In fact the prediction for DEKS was the 
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weakest of all and DERS and NEKS were ranked seventh and ninth out of 
eleven, respectively. Even more striking was the total lack of evidence for ®-turn 
formation in NPDM, NSDM and NDDS, the three tetrapeptides with by far the 
strongest predictions. It could be argued that the use of parameters derived 
from a protein structure data base for prediction of B-turns in small peptides in 
solution is inappropriate. However the f-turn is generally considered to be a unit 
of secondary structure which derives its stability from local steric factors and 
hydrogen bonding. Consequently we would have anticipated being able to 
detect at least some evidence for f-turns under the structure-promoting, low 
dielectric constant conditions used here. These findings could be reconciled if, 
in proteins the B-turns fall into two classes - those that are intrinsically stable and 
those that depend on long-range interactions (for example between strands of B- 
sheets) for their stability. Validation of this suggestion is clearly outside the 
scope of the present study. 

To study this selectivity further, a series of calculations were carried out 
using the software "Biograf"4°. In the program, the AMBER force field4*’ was 
used. A distance-dependent dielectric constant was used to qualitatively 
simulate the fact that the peptides are in a methanol/water solvent and the 
intramolecular electrostatic interactions should die off more rapidly with distance 
than in the gas phase. But for the system of tetramers, as was expected, a 
simple energy minimization process could not provide any meaningful results to 
explain the experimental results, since the molecules possess too many 
conformations, undergoing rapid interchange between different conformers and 


have many degree of freedom and local energy minima. 
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Table 4.1 Summary of Experimental Parameters Used in the Two- 


Dimensional NMR Experiments@ 


Parameters COSYPH ROESYPHD 
Sweep width in F2 (Hz) 3600 3600 
Sweep width in F1 (Hz) 1800x2 1800x2 


Matrix size (F1xF2) 


before zero-filling 256x1K 256x1K 
Matrix size (F1xF2) 

after zero-filling 512x2K 512x2K 
Evolution time 

Initial value (us) 1 1 

Increment (us) 139 139 
Number of scans 48 64 

(dummy scans) (2) (2) 
Acquisition time (s) O54 0.57 
Relaxation delay (s) 1.8 1.8 
Other delay (ms) -- 300° 
Window functions for 2D FT S/S S/S 

(F1/F2)4 
Shifts of window function in 6/8 3/4 


fractions of m (F1/F2) 
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Some parameters are slightly different, depending on the molecule. 
The ROESY experiment was performed several times with different 
transmitter frequencies and mixing times (see text). 

Spin locking time at an average field strength of 4.0 kHz 


S stands for a sine bell. 
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Table 4.2 Proton Chemical Shifts®, Coupling Constants® and NH 


Temperature Coefficients© of the Peptides Studied4 


Pep. NH a pt yf af € J(Hz) AT 

Res. (ppb/K) 

NEKS 

NAc 2.03 

(2.01)© 

Asni 8.39 466 2.73,2.78  7.66,6.92 7.2 6.4 

(8.35) (4.66)  (2.69,2.80) (7.64,6.92) (7.0) 
(NH5)9 

Glu2 9.07 4.22 2.03 2.28 5.0 4.2 
(8.46) (4.31) — (1.94,2.10) (2.46) (6.6) 

Lys3 8.27 433 > 1:82:1.88 1.47 1.68 2.97 7.3 5.1 
(8.31) (4.33) (1.78) (1.46) (1.69) (2.98) (7.0) 

Ser4 7.96 4.35 3.83,3.88 7.3 3.8 
(8.017) (4.38) (3.81) (7.4) 


NHp = 7.45,7.13 
(7.52,7.13) 
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NAc 2.03 
(2.02)© 

Glu 8.41 4.22 1.91,2.02 pts 6.0 5.6 
(8.30) (4.29)  (1.95,2.09) (2.44) (7.0) 

Glu2 8.78 4.26 1.91,2.02 2.25 6.5 5.3 
(8.44) (4.34) — (1.95,2.09) (2.44) (7.0) 

Lys3 8.32 4.35 126, 1.00 1.43 1.68 2.98 rA@) 6.4 
(8.33) (4.34) (1.75,1.86) (1.44) (1.68) (2.97) (7.5) 

Ser4 8.11 4.36 3.83,3.88 7:5 5.3 
(8.13) (4.39)  (3.82,3.85) (7.5) 


NHp = 7.13,7.52 
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(2.45) (2.00,2.14) 
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Tis 
7.5 
7.5 


7.5 


6.4 


5.5 


4.3 


3.9 


Chemical shifts are reported in ppm relative to the methyl group of 
CD30H (3.30 ppm downfield from TMS ). 

Coupling constants in Hz refer to JNH-Ha and were taken from 1D 
spectra. 

The effect of temperature on the chemical shifts of the 
exchangeable protons is an average value of temperature 
coefficients measured at either eight or four points between 284K 
and 300K, at 400 MHz as discussed in the text. 

The spectra were measured over a pH range of 5.5 to 6.6 which 
is well above the pKa's of D and E with the exception of NQKS 
(pH 5.3). The concentrations studied at this pH were between 3.9 
and 6.8 mmol/L in 60% CD30H / 40% H2O0 at ambient 
temperature. Six tetrapeptides were also measured at low pH, 
ranging from 1.9 to 3.2, well below the pKa's of D and E. Higher 
concentrations (6.8 to 23.1 mmol/L) were used to confirm the 
absence of even small ROE's under these conditions. 

The numbers in brackets are for the samples at low pH. 

The pairs of protons x (x=a, f, y ...) could not be assigned 
stereospecifically. 


The chemical shift for the cis-isomer is 8.402 ppm. 
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Table 4.3. Summary of 1H Chemical Shifts@, Coupling Constants and NH 
Temperature Coefficients of DEKS, DERS and NEKS in 
CD30H/H20 (60/40) 
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a) Chemical shifts in ppm relative to the methyl signal of CD30OH 
(3.30 ppm downfield from TMS). 

b) These data on DEKS were published previously! 1 at 283K, pH 
6.4 at a concentration of 10 mM. The values in brackets were 
obtained at pH 3.5, otherwise the same experimental conditions 


were used. 
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Table 4.4 


a) see reference 13. 


b) Probability in multiples of cut-off probability. 


Probabilities of B-turn Formation in Tetrapeptides@ 


Peptide 
DEKS 
DQKS 
DDKS 
NEKS 
NQKS 
EEKS 
DKES 
DERS 
DQRS 
DDRS 
NDRS 
EERS 
NERS 
NPDM 
NSDM 
NDDS 


C) Below cut-off. 


d) Type | also possible but at lower probability. 


e) Type N also possible but at lower probability. 


Type | 
aC 

Type | 
Type | 


ets 
SC 

Type N 
Type | 
ec 

Type | 
Type | 
mec 

Type | 
Type | 
Type | 
Type | 
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Prediction 


1.0126 
<1.0 
1.094 
1.361 
<1.0 
<1.0 
1.845¢ 
2,.228¢ 
<1.0 
2.410 
3.241€ 
m4 (10) 
2.997 
231397 
17.244 
4.784e 
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Figure 4.1 


NH-NH region of the 500 MHz _ phase-sensitive ROESY 
experiments on DEKS, DERS and NEKS in CD30H/H90 (60/40). 
The spectra were recorded at 20°C. The sample concentration 
was 5 mM, pH 5.8. The spin-locking time was 300 ms at a field 


strength of 5.0 kHz. The cross-peaks are of opposite sign with 


respect to the diagonal. 
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CHAPTER 5 


Proton Magnetic Resonance Studies of Bradykinin 
Antagonists” 


Introduction 

Bradykinin (BK) is a linear peptide hormone (Arg! -Pro2-Pro3-Gly4-Phe9- 
Ser§-Pro’-Phe8-Arg9) and has been implicated in a multitude of 
pathophysiological processes!:*. Of particular significance is its role as a potent 
pain-producing agent. Recently, it has been suggested that bradykinin may be 
associated with the symptoms of the common cold3:4. Because of these 
physiological functions, a bradykinin receptor antagonist may have significant 
therapeutic value, and the synthesis of these antagonists has been the subject of 
intensive research. 

Many bradykinin antagonists have been synthesized with the aim of 
increasing the potency, selectivity and lifetime of these antagonists>©. in almost 
all of these syntheses, analogs of natural amino acids were used to impose 
specific conformational restraints’® in order to obtain insight about their 
bioactive conformation. So far, BK analogs having bulky, beta-branched or cyclic 
D-aliphatic residues at position 7 combined with bulky or cyclic L-aliphatic 
residues at position 8 have yielded the best antagonists9:!9, as have bradykinin 
antagonist dimers!!. 

Earlier conformational studies in solution!?:'13 on bradykinin itself and an 


antagonist led to a hypothesis that the difference between an agonist and an 


* A version of this chapter has been accepted by Biopolymers. Liu, X., Stewart, J. M., Gera, L., 


& Kotovych, G. (1993), Biopolymers, in press. 
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antagonist is related to the type of B-turn adopted at the C-terminus together with 
the required orientation of the side chains'*. To challenge this hypothesis, Kyle 
et al.14.15 designed and prepared several constrained bradykinin analogs, which 
were assumed to have an inherently stabilized f-turn geometry at their C- 
terminus based on conformational analysis using empirical calculations. Two of 
them were found to be competitive antagonists with big bulky side-chains at 
position 7 or 8, such as DArg9-Arg!-Pro2-Hyp3-Gly4-Thi9-Ser8-DTic’-Tic8-Arg9 
(Tic = tetrahydroisoquinoline carboxylic acid residue), but NMR data were not 
reported for them. More recently, two bradykinin analogs containing a-MePro at 
position 3 or position 7 were reported to show reversed-turn conformations at 
both Pro2-Phe? and Ser®-ArgY, respectively, in aqueous solution by NMR, but 
no activity data were reported on these molecules'®, 

In the present study, we investigated by 1H NMR the solution 
conformation of [DArg9, Hyp3, ThiS, DCpg’, Cpg§]-BK (I) (Cpg = a-cyclopentyl- 
glycine; Hyp = trans-4-hydroxy-L-proline, Thi = 6-(2-thienyl)-L-alanine) (Figure 
5.1), an extremely potent bradykinin antagonist and several related antagonists, 
which differ from each other only by one or two residues, in order to provide 
more experimental evidence for the hypothesis that a B-turn in the four C- 
terminal amino acid residues of bradykinin analogs might be a prerequisite for 
their high activities!?. This study was carried out on the antagonists in the free 
state as it is impossible to carry out NMR studies when they are bound to the 
receptor. These antagonists function at nanomolar concentration, far below the 
limit of NMR detection sensitivity. The solvent for this study is a methanol/water 
(80/20 v/v) mixture, which is a structure-inducing solvent that has been used in 
the study of other small peptides'’. The sequences for all the peptides studied 
are listed in Table 5.1 and pharmacological data are listed in Table 5.2. This 1H 


NMR study is important to help clarify the role each residue plays and to 
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understand the geometric requirements for bradykinin antagonists. In a 


separate paper, we present a similar NMR study on two agonist-antagonist 


pairs 18, 


Materials and Methods 
Materials and Sample Preparation 

The peptides were synthesized, purified and assayed in the classical BK 
assay systems as reported previously®. The samples used in all 1H NMR 
experiments were prepared by dissolving purified solid peptides in 160 wl HoO. 
The pH was then adjusted by adding small aliquots of diluted NaOH or HCl, 
followed by adding 640 ul CD30H to make the solvent an 80/20 methanol/water 
mixture. Before the NMR experiments, the samples were degassed with argon. 
The sample concentrations were between 0.9 and 1.4 mmol/L and the pH of the 
samples was in the range of 5.2 to 5.4. Aggregation is minimized by these low 
concentrations, together with the fact that the active peptides contain four 
positive charges. Peptide | was diluted by a factor of ten with no substantial 


changes in chemical shifts, also indicating the absence of aggregation. 


NMR Spectroscopy 
All experiments were carried out on a Varian UNITY 500 NMR 


spectrometer using a proton-selective 5 mm probe with a 90° proton pulse length 
of 8.8 to 10.0 us at normal power levels. During the experiments the 
temperature was always controlled to +0.1°C and set to 0°C except for the 
temperature studies. For FID acquisitions and data processing, VNMR 3.1 and 
3.2 software were used on a SUN 4/330 and a SUN 4/470 workstation, 
respectively. The proton chemical shifts were measured relative to the 


undeuterated fraction of the methyl group of CD3OH (Merck, Sharp & Dome, 
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MD-67, 99.9% D) at 3.30 ppm. All 2D Spectra were acquired non-spinning, 
whereas for 1D experiments the spinner was turned on. 

In all NMR experiments, the water resonance from the solvent was 
suppressed during the relaxation delay by presaturation. The decoupler offset 
was set equal to the transmitter offset to prevent phase distortions around the 
water peak due to imperfect cancellation of the dispersive component of residual 
water magnetization'9. Before each experiment, the delays before and after the 
reading pulse (alfa delay) were carefully adjusted to give a flat and undistorted 
baseline29, 

One-dimensional proton spectra were recorded with a sweep width of 
SO00HZ, a relaxation time of 2s and 16K (zero-filled to 32K) data points. Usually 
a total of 32 scans were accumulated, and the resulting FID's were resolution 
enhanced by means of a Gaussian multiplication with a line broadening factor of 
typically -3 Hz. 

The temperature dependence of the amide protons was calculated by 
measuring the chemical shifts at -5, 0, 5, 10, 15, 20 and 25 °C for all peptides 
and then analyzed by means of a linear regression. The controller temperatures 
were calibrated using a methanol sample provided by Varian@! and were found 
to be in the range of +/-1.0 °C of the predicted temperature. 

Two-dimensional experiments were measured with 2048 (zero filled to 
4096) or 4096 data points along to and 256 ty increments. Spectral widths of 
5000 Hz in both dimensions and a relaxation delay of 1.5 to 2.0 s were 
employed. All two-dimensional experiments were acquired in the phase- 
sensitive mode by using the hypercomplex technique known as the States- 
Haberkorn-Ruben method22. The TOCSY®% experiments were carried out by 
using the basic pulse sequence proposed by Bax24,. A 2ms trim pulse preceded 


the 70-80ms MLEV-17 spin-lock (field strength 7.0 kHz). 16 to 32 scans were 
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accumulated per experiment. A n/2 shifted, squared sine bell weighting function 
was used in both dimensions. The ROESY25.26 experiments were recorded with 
the pulse sequence 90°-t1-90°-SL-90°-FID. Spin-lock conditions (SL) were 
achieved by a series of hard pulses of 30° flip angle?” and the 90° pulses 
immediately preceding and following the spin locking period were added to 
compensate for the non-negligible offset dependence of the cross-peaks 
(compensated ROESY)?8. The experiments were carried out with mixing times 
of 100 to 200 ms, spin lock field strength of 5.0 KHz and 32 scans per FID. In all 
cases, n/2 shifted, squared sine bell weighting functions were used. The 
NOESY experiments@2:3°0 were carried out with the 90°-t1-90°-mix-90°-FID pulse 
sequence, 200 or 300 ms mixing time and 32 scans per t1 increment. Water 
suppression was achieved by decoupler saturation at all times except during the 
acquisition period. A 10% random delay was incorporated in the mixing delay to 
suppress zero quantum coherences. For TOCSY, ROESY and NOESY 
experiments, the compensation for first point distortion caused by analog filters?! 
was achieved by reducing the first point of each FID and each t1-interferogram 
by empirically determined values®°. For all the 2D spectra, baseline corrections 


were not necessary. 


Results 


Chemical Shift Assignments and Coupling Constants 


The phase-sensitive TOCSY spectra of the six peptides in methanol/water 
(80/20) alone allowed the identification of all unique amino acids in a 
sequence23, Then, a sequential analysis was performed using the NOESY or 
ROESY spectra?2 to achieve the complete assignment. These assignments rely 
on Hg-NH(i, i+1), NH-NH(i, i+1) and for Pro residues, Ha-Hs aii, i+1) 


connectivities. The NH resonance of the first N-terminal residue is always 
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absent due to fast exchange with the solvent since the peptides are not 
protected. In addition, the NH resonances of Arg! for peptides | to III are usually 
very weak and can only be seen at low temperature. However, the spin systems 
for these Arg resonances can be easily identified through their NH resonance in 
the side chain. It was difficult to assign specifically each of the aliphatic ring 
protons in the Cpg residues because of spectral overlap. At best, the four 
protons at positions 2 and 5 can be assigned, but the protons at positions 3 & 4 
in each ring cannot be resolved. The cis isomers due to the X-Pro or X-Hyp 
peptide bond are not observed. This observation is consistent with the 
suggestion based on NMR experiments run in a lipophilic environment that all 
amide bonds remain trans in both agonists and antagonists!9. 

The assigned resonances for all the peptides are summarized in Table 
5.3. As an example of the experimental data, the relevant section of the TOCSY 
contour plot for peptide I, recorded with a mixing time of 80 ms is shown in 
Figure 5.2, whereas the NH/NH section of its NOESY spectrum recorded with a 
mixing time of 300 ms is depicted in Figure 5.3. 

Peptides | and II exhibit many similarities in terms of chemical shifts, 
which is understandable since the peptides only differ by one residue (Thid in Lis 
replaced by Phe? in Il). From peptide | to peptide Il, very little change in the 
chemical shifts of the amide protons was observed. In these two peptides, the 
NH resonances for Gly4 are to low field and the two a-protons are well 
separated, indicating less flexibility in this region of the molecules. Similar 
behavior for Gly4 was observed for peptide Ill. However, it is interesting to notice 
that the chemical shift for some NH protons changes when one or two residues 
in a peptide are changed. When DCpg’ in peptide Il is replaced by Cpg’, 
resulting in peptide Ill, the chemical shifts for the NH protons of Cpg8 shift to 
high field (8.47 to 8.17 from II to Ill). The chemical shifts of the other NH 
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resonances for the neighboring residues were also changed, but to a lesser 
extent. 

All vicinal coupling constants of the type 3J,,.n}4 could be measured in the 
one-dimensional spectra, at different temperatures as required to minimize 
spectral overlap. None of the coupling constants changed by more than 0.3 Hz 
over the temperature range -5 to 25°C, an indication that the conformation does 
not change significantly. All the observed SJynH values are close to the time- 
averaged value of 6.3 Hz for free rotation about the N-C, bond. The expected 
value for residues at positions i+1 and i+2 for ®-turns in proteins are predicted to 
be approximately 4 and 9 Hz, respectively22, but because of the presence of 
Pro? and Hyp3 (or Pro3), residues for which coupling constants are not 
available, no firm conclusions about preferred conformations can be drawn from 


these data. 


Temperature Dependence of NH-chemical Shifts 


An amide proton involved in a stable intramolecular hydrogen bond, or 
one inaccessible to solvent for steric reasons typically shows a reduced 
temperature coefficient in the range 0 to -3 ppb/K%3. Although such low 
temperature coefficients do not necessarily mean the presence of a folded 
structure, they are important indicators of secondary structure when combined 
with NOE data, especially in the case of small peptides. The temperature 
coefficients for the amide protons of peptides | to IV are listed in Table 5.3. The 
important data for peptides | and Il are summarized in Figure 5.4. In all cases, 
the chemical shifts of the amide protons varied linearly with temperature with 
correlation coefficients better than 0.988, indicating that major conformational 
changes are not present in the temperature range under study. In peptides | and 


Il, which behave similarly in terms of temperature coefficients, Thi? and Phe? 
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displayed low temperature coefficients of -2.4 and -2.9, respectively, which 
suggest that these protons might be involved in a turn. In comparing peptides | 
and Il with peptide Ill, the low temperature coefficients at position 5 disappeared. 

It is interesting to compare the temperature coefficients of Phe? or Thi 
and Arg9 in all the peptides studied, which are summarized in Table 5.2. A 
general trend can be seen, namely that the two active antagonists (I & Il) have a 
low value at position 5 and high value at position 9. Less active antagonists 
have a lower value at position 9 and a higher one at position 5 (peptides III, IV). 
In the case of agonists, the NH of Arg9 was involved in an H-bond and displayed 


the lowest temperature coefficients in the peptides'8, 


NOE Information 

Both ROESY and NOESY experiments were carried out for all the 
peptides. In all the NOESY experiments, the cross-peaks have the same sign 
with respect to the diagonal peak, which means that the NOEs are negative, 
probably because the experiments were carried out at O°C. These two 
experiments showed similar results, but with different NOE intensities. For 
peptides I and Il, the NOESY experiments showed stronger NOEs than the 
ROESY experiments, thus revealing more information. For peptide IV, NOESY 
and ROESY were about the same. For peptide Ill, the ROESY experiment 
appeared to be better than the NOESY. For peptide I, the ROESY experiments 
were carried out with different mixing times and different strengths of the spin- 
lock to detect any artifacts34.35. NOESY experiments were run with different 
mixing times as well. For peptides | and Il, the cross-peak volumes of the 
NOESY at a mixing time of 300 ms were integrated with a correction for t1 or t2 
ridges and for slight phase imperfection by measuring both cross-peaks. Where 


applicable, the volumes were normalized for the contributions of more than two 
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protons to the cross-peak by a factor of (2NaNp)/(Na+Np), where Na and Np are 
the number of protons responsible for the diagonal peaks a and b, 
respectively36, The results based on these integrals for peptides | and Il are 
summarized in Figure 5.4. 

For each peptide, a set of expected, consecutive, strong dgn(i, i+1) and 
weak dynii, i+1) were identified in the ROESY or the NOESY spectra. 
Numerous other ROEs or NOEs were observed between protons in neighboring 
residues. Figure 5.3 shows the NH/NH region of a 500 MHz phase-sensitive 
NOESY experiment of peptide | at a mixing time of 300ms. As can be seen, the 
NHs of residue 5 and 6 overlap at 0°C and have a similar temperature coefficient 
for peptides I-Ill, making it impossible to decide whether cross-peaks exist 
between those two NHs as they are too close to the diagonal. 

In addition, some medium range NOEs were identified in peptides | and Il. 
In peptide Il, they were between the NH of Phe° and each of the following 
protons: Hg, (Hyp3), Hg(Pro2). In peptide |, they were between Hg of Pro? and 
NH of Thi?. The NOE between H, of Hyp3 and NH of Thi is not observable 
because of overlap of the Hq protons of Thi? and Hyps. These medium range 
NOEs are necessary to prove a turn-like structure. Extra NOE information 
cannot be extracted from the data on peptide III since the NHs of residues 5 to 8 
overlap. For peptide IV, in addition to the usually observed intra- and inter- 
residue ROEs associated with an extended conformation, some unusual ROE 
patterns were observed around residues 7 and 8. A very strong NH-Hp(Phe®) 
cross-peak was found but no NH/Hg(Phe8) and also no NH-H,, (Phe8) 
interactions were evident. Furthermore, ROEs between Hy(DCpg’) at 0.72 ppm 
and NH(Phe8) as well as between Hyy\(DCpg’) at 0.72/0.91 ppm and the 


aromatic ring protons of Phe® were observed, indicating rigidity within this region 


of the molecule. 
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Discussion 

The conformations of peptides are defined by a set of torsion angles for 
each residue which, in principle, can be derived from coupling constants, the 
strength of the nuclear Overhauser enhancements and amide proton exchange 
rate and/or temperature coefficients. Unfortunately, these general rules can only 
be applied when the studied peptide possesses a relatively well-defined or rigid 
conformation. In the case of small peptides like the bradykinin antagonists, the 
peptides possess a high number of conformations and all measurable 
parameters are therefore population-weighted averages of the conformational 
states available to the peptide, which precludes a numerically precise 
interpretation of the data. But a list of accepted criteria for turn formation, some 
of which are considered more definitive than others, can be used37, which 
include: (a) dyn(3,4) NOE connectivities (type | turns also have a dyn(2,3) 
connectivity); (b) a dgn(2,4) NOE connectivity; (c) appropriate SJHNo coupling 
constants (not always seen); (d) a low temperature coefficient for the NH of 
residue 4 of the turn, indicating hydrogen bonding. Although such a hydrogen 
bond is not a requirement for turn formation, at least not in proteins, it is a very 
important stabilizing factor in small peptides; and (e) characteristic CD or 
vibrational spectra. In light of these criteria, it is possible to draw only a 
qualitative conclusion about the presence of a turn in the peptides. 

Two highly active bradykinin antagonists, peptides | & II, which differ from 
each other at position 5 (Thi? vs. Phe°), show almost identical NMR spectra in 
terms of chemical shifts, temperature coefficients and NOE information. Based 
on their sequential NOEs, extended conformations are dominant in solution for 
both molecules. A series of NH(i)-NH(i+1) NOEs, which were observed with a 


similar intensity, except between DCpg’ and Cpg8, are compatible with the 
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extended conformation. The relatively weaker NH/NH NOE between DCpg/ and 
Cpg8 may result partially from restrictions caused by the two bulky aliphatic side 
chains. The extended conformation for the C-terminal end is thus influenced by 
the bulky side-chains at DCpg’ and Cpg8. However, observed medium-range 
NOEs, i.e. Hg (Hyp3) with NH(Phe°), Hg (Pro?) with NH(Phe®) in peptide Il, 
Hg (Pro@) with NH(Thi°) in peptide I, are not compatible with a fully extended 
conformation. These weak NOEs can only be explained by a reverse turn for 
this section of the peptides. This conclusion is supported by the low temperature 
coefficient of Phe? and Thi9, respectively, indicating that their amide protons are 
involved in a hydrogen bond. Although one of the important criteria, namely the 
NOE between H,,(Hyp3) and NH(Thi9) in peptide | is unobservable because of 
the overlap of the Ha protons of Thi? and Hyp4, this is observed in peptide Il and 
hence can be inferred for peptide | because of the similarity between peptides | 
& Il. 

It is not surprising to find evidence for the turn-like structure between 
residues 2 and 5 in peptides | and Il. According to Wilmot & Thornton's 
prediction38, Pro in position (i+1) is the most strongly preferred amino acid for 
both type | & Il B-turns, because the proline-inherent restriction of o to 
approximately -60° satisfies the o(i+1) requirement for both types. On the basis 
of the Chou and Fasman prediction29, the sequence Pro?-Pro3-Gly4-Phe® has a 
significant B-turn probability. Our results are consistent with a previous study of 
an antagonist!2 and of the a-methylproline analog'® of bradykinin where a turn- 
like structure was detected from position 2 to 5. As in the previous studies!@.16, 
it is difficult to determine which type of turn is adopted by those four residues. 
The differences between the type | and the type II B-turns in terms of NMR 
information are a lack of NH(i+1)/NH(i+2) NOEs and stronger Ha(i+1)/NH(i+2) 


NOE in the type II B-turn32. Both pieces of information are unfortunately missing 
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because residue (i+1) is Hyp? and the domination by the extended conformation. 
It is more surprising that, contrary to the early hypothesis, in which turn-like 
structure between Ser® and Arg9 were predicted to be important for activities of 
antagonists!2, residues between Ser® and Arg? are not involved in a f-turn as 
evidenced by the absence of both medium-range NOEs and low amide 
temperature coefficients for the C-terminal residues, although molecular 
modeling calculations showed that the bulky aliphatic side-chains at positions 7 
& 8 have an inherently stabilized B-turn geometry!4:15. 

The less active bradykinin antagonist, i.e. peptide Ill showed no evidence 
to support the turn-like structure between residues 2 and 5. However, it seems 
there was some rigidity around Gly4 since its a-protons exhibit a large chemical 
shift difference as observed in peptides | & Il. Replacing DCpg/ in peptide II 
with Cpg/ to form peptide Ill makes peptide III less structured, which shows the 
importance of the D-amino acid at position 7. In peptide Ill, the NHs for residues 
5 to 8 exhibited NH temperature coefficients that were uniformly similar at about 
-5.0 ppb/K. Compared to Cpg8 in peptide |, the NH of Cpg® in peptide III shifted 
to high field by 0.3 ppm probably due to shielding from two aliphatic rings. For 
peptides IV (very weak antagonist), no structure other than extended can be 
defined, except NOEs showed some rigidity for residues DCpg’ and Phe8 (see 
results). 

Based on this study, the correlation between the existence of a turn-like 
structure from residues 2 to 5 with antagonist activity is found. The turn-like 
structures are based on the combination of the slow NH temperature coefficients 
at position 5 (Table 5.2), the NH/NH NOEs between residues 4 & 5 and a 
medium range Ha(i)-NH(i+2) (Figure 5.4). This is contrary to the early prediction 
in which the nature of the B-turn between residue 6 and residue 9 was thought to 


be responsible for changing the activity from an agonist to an antagonist'*. We 
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think the NH temperature coefficients are good indicators for turn-like structures, 
since the stabilizing effect of a hydrogen bond is important for the turn formation 
in peptides. As can be seen for peptides | to IV, the NH temperature coefficients 
at position 5 increase, while at position 9 they decrease with a decrease in 
activity. This indicates that a turn becomes less likely between residues 2 & 5. 
In a similar study on a bradykinin agonist-antagonist pair (the agonist DArg9- 
Arg!-Pro?-Hyp3-Gly4-Thi9-Ser8-Pro’-Thi8-Arg2 and the corresponding 
antagonist with DPhe in position 7), a type | B-turn was found between residues 
6 and 9 in the agonists and some indication for a turn between residues 2 & 5 
was found for the antagonist!8. For this agonist, the NH of Arg9 was involved in 
an H-bond and displayed the lowest temperature coefficients in the peptide. All 
of these data suggest that the conformational difference between bradykinin 
agonists and antagonists are due to a turn formed at different positions in the 


peptides. 
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Table 5.1. Peptides Studied. 


| DArg9-Arg!-Pro?-Hyp3-Gly4-Thi9-Ser8-DCpg?-Cpg8-Arg? 
1 DArg9-Arg!-Pro?-Hyp3-Gly4-Phe9-Ser8-DCpg’-Cpg8-Arg? 
i DArg9-Arg!-Pro?-Hyp3-Gly4-Phe9-Ser6-Cpg’-Cpg8-Arg? 
IV Arg! -Pro2-Pro3-Gly4-Phe9-Ser8-DCpg’-Phe8-Arg9 


Table 5.2. Pharmacological Data and NH Temperature Dependence 


Coefficients at Positions 5 and 9 of Peptides I-IV. 


Peptides pAo(Rat pAo(Guinea NH Coeff. at NH Coeff. at 
uterus) pig ileum) Position 5 Position 9 
(-ppb/K) (-ppb/K) 
| re 6.6 2.6 Ti Te 
lI 7.9 Tal 2.9 Ted? 
II oH a5! 4.4 ons 
IV inactive | Sh, 4.2 


pAo = negative logarithm of molar concentration of antagonist to give 50% 
inhibition. 


| = weak inhibitor. 
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Table 5.3 Proton Chemical Shifts@, Coupling Constants? and NH 
Temperature Coefficients© of the Peptides Studiedd 


Peptide I. 

Peptide NH he 6 y other protons —J(Hz) AT 

Residue (-ppb/K) 

DArg® : 3.88 1.85,1.85 1.63,1.63 5:3.20,3.20 . : 
eNH:7.78 

Arg! 9.00 4.55 1.76,1.76 1.68,1.64 5:3.12,3.06 : : 
€NH:7.50 

Pro2 . 4.68  2.28,2.28 1.93,1.86 5:3.84,3.41 : 

Hyp! : 452 2262.02 456 8:3.79,3.79 : : 

Gly4 8.82  4.05,3.82 5.5/6.0 7.4 

Thi?g 8,30 4.52 3.38,3.26 6.94(3',4') 7.5 2.6 

7.26(5') 
Ser6 8.29 4.43  3.81,3.75 7.0 4.1 
DCpg’"_ 8.09 4.24 2.26 1.74,1.64,1.54 8.0 7.1 
1.26 

Cpg® 8.47 4.14 2.28 1.78,1.70,1.64,1.53 8.0 7.6 
1.35,1.24 

Arg? 7.92 413  1.85,1.85 1.69,1.60 §:3.18,3.18 7.5 ZiT 
€NH:7.44 
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Peptide Il. 


Peptide NH a B y other protons _—_J(Hz) AT 

Residue (-ppb/K) 

DArg® : 3.90 1.86,1.86 1.62,1.62 §:3.20,3.20 : : 
eNH:7.75 

Arg! 8.99 4.54 1.70,1.76 1.62,1.62 5:3.08,2.99 : . 
e€NH:7.49 

Pro : As70 #0) 2.00.2 409 1.97, 1.90 §:3.83,3.44 : - 

Hyp? - 453 2.25200 4.56 5:3.80,3.80 : : 

Gly4 8.76  3.99,3.79 5.8/5.8 7.5 

Phe> 8.26 4.49  3.14,3.03 7.23(2,6,4) 5.9 2.9 
7.28(3,5) 

Ser6 8.27 4.43  3.80,3.75 7.5 3.4 

DCpg’ _— 8.03 4.24 2.26 1.75,1.65,1.55 8.1 6.3 
1.28,1.25 

Cpg® 8.47 4.14 2.28 1.79,1.70,1.64,1.55, 7.6 7.5 
1.36,1.26 

Arg? 7.91 4.14  1.85,1.70 1.60,1.60 6:3.17,3.17 7.8 ray 
eNH:7.43 
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Peptide Ill. 


Peptide NH 1 p y other protons J(Hz) AT 

Residue (-ppb/K) 

DArg® . 3.90 1.86,1.86 1.64,1.64 6:3.21,3.21 : : 
eNH:7.62 

Arg! 8.96 4.57 1.69,1.64 1.78,1.78 §:3.12,3.12 : : 
eNH:7.48 

Pro@ : 4.71 2.32,2.32 1.97,1.91 5:3.85,3.48 : ; 

Hyp? : 4.51 2.27,2.00 4.57 5:3.80,3.80 : . 

Gly4 8.73  4.02,3.72 5.5/6.0 rie 

Phe? 8.19 4.59  3.15,3.00 7.23(2,6,4) 6.0! 4.4) 
7.28(3,5) 

Ser6 8.21 4.40 —3.80,3.75 5.5! 4.7) 

Cpg’ 8.18 4.14 2.24 1.77,1.65,1.55 7.5! 5.9) 
1.33,1.26 

Cpg® 8.17 4.10 2.24 1.77,1.65,1.55 6.5! 4.9) 
1.33,1.26 

Arg? 7.86 4.18  1.84,1.84 1.70,1.58 pacieyaW vale 7.5 5.3 
eNH:7.46 
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Peptide IV. 


Peptide 
Resk 
Arg! 


Pro? 
Pro? 


Gly4 


Phe? 
Ser® 
DCpg’ 


NH 


8.79 
7.78 


4.28 


4.67 
4.18 


1..89,1..89 


2.38,2.38 
2.05,2.00 


3.14,2.99 
3.79,3.71 
2.01 


S.3/;2.1 b 
1.895176 


y 


1.69, 1.69 


2.00,1.93 
1.85,1.85 


elie) 
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other protons 


6:3.14,3.14; 
€NH:7.53 

6:3.73,3.49 

5:3.83,3.66 


ring:7.24 


1.07,1.55,1.43,1. 


30 


0.72,0.91,1.47! 


ring:7.27 
6:3.18,3.18 
€NH:7.46 


J(Hz) 


AT 
(-ppb/K) 


8.9 
4.2 


| | w+ 
op f aR’ G 
co.) OO.S. S 
s cb - 


a6 8,688 aa8 


POSE E Pub oe yas 
FE eY 

ca eS.6 
8b 2 


Chemical shifts are reported in ppm relative to the methyl group of 
CD30H at 3.30 ppm downfield from TMS. 

Coupling constants (in Hz) refer to JjH-q and were measured from one- 
dimensional spectra unless indicated otherwise. 

The temperature coefficients of the amide protons are average values of 
temperature coefficients measured at seven points between -5 to 25 °C 
as discussed in the text. 

The data were recorded at 500 MHz in CD3OH/H90 (80/20 v/v%) at 0 °C 
and at a pH of 5.2 to 5.4. The concentrations for the samples were 
between 0.9 and 1.4 mmol/L. 

Pairs of geminal protons x (x=a, f, y ...) are not assigned 
stereospecifically. When two distinct lines were observed, the larger 
chemical shift was arbitrarily assigned to x and the smaller to x’. 

Hyp refers to trans-4-hydroxy-L-proline. 

Thi refers to B-(2-thienyl)-L-alanine. 

Cpg refers to a-cyclopentyl-glycine. The protons of the aliphatic ring were 
not assigned individually, unless otherwise indicated. 

These values were taken from a 1D spectrum at 25°C. Because of 
spectral overlap, the error limits may be as large as + 1 Hz. 

These values are based on two TOCSY experiments at 0°C and 25°C. 
Additional weak peaks (~12%) were observed for this peptide. 

The assignment for the aliphatic ring protons are: 1CH(2.01), 


2,5CH9(0.72/0.91, 1.07/1.55), 3,4CH2(1.47/1.47, 1.43/1.30). 
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Figure 5.1. The primary structure of peptide I. 
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Figure 5.2 The contour plot of a phase-sensitive 500 MHz TOCSY experiment 
of peptide | in CD30OH/H20(80/20). The concentration was 1.4 
mM, pH 5.4 and temperature 0°C. A spin-locking time of 80 ms 


was used. Experimental details are discussed in the text. 
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Figure 5.3 The NH/NH region of a 500 MHz phase-sensitive NOESY 
experiment of peptide | at a mixing time of 300 ms. The 


experimental conditions are as outlined in Figure 5.2. 
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Figure 5.4 


NOEs DArg® Arg! Pro* Hyp? Gly* Thi®Ser®°DCpg’ Cpg® Arg? 
Ho(i)-NH(i+1) Ee ae See 
Ha(i)-Hg'(i+1) React arse 


Fia(i)-NH(i+1) sas pega 


— 
NH(i)-NH(i+1) nee ©: =: ___aeee 
H.,(i)-NH(i+2) 

AS/AT(NH) oa 


DArg°Arg' Pro* Hyp*Gly* Phe*Ser®DCpg’ Cpg® Arg? 
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The amino acid sequences and the NOE summary of peptides | 
and Il. The observed sequential and medium range NOEs are 
indicated schematically by solid bars. The height of the bars 
reflects the NOE intensities based on integration. The temperature 
coefficients of the amide protons are shown with a solid bar for 
values between 0 and -3.0 ppb/K. NOEs shown with dashed lines 
might be present but were not observable due to almost identical 


chemical shifts of the amide protons or spectral overlap. 
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CHAPTER 6 


The Aggregation Properties of Some Bradykinin 
Analogs” 


Introduction 

Bradykinin (BK) is a linear peptide hormone (Arg!-Pro2-Pro3-Gly4-Phe9- 
Ser8-Pro’-Phe8-Arg9) and has been implicated in a multitude of 
pathophysiological processes!:. Of particular significance is its role as a potent 
pain-producing agent. Recently, it has been suggested that bradykinin may be 
associated with the symptoms of septic shock, asthma and the common cold34. 
Because of these physiological functions, a bradykinin receptor antagonist may 
have significant therapeutic value, and the synthesis of these antagonists has 
been the subject of intensive research. 

Many bradykinin antagonists have been synthesized with the aim of 
increasing the potency, selectivity and lifetime of these antagonists®.® In almost 
all of these syntheses, analogs of natural amino acids were used to impose 
specific conformational restraints’8 in order to obtain insight about their 
bioactive conformation. So far, bulky, beta-branched or cyclic D-aliphatic 
residues at position 7 combined with bulky or cyclic L-aliphatic residues at 
position 8 have yielded the best antagonists? 1°. 

Conformational studies in solution of a series of bradykinin agonists, 
antagonists and analogs have been carried out. Our results led to a hypothesis 


that the difference between an agonist and an antagonist is due to a turn formed 


* A version of this chapter has been accepted by J. Biomol. Struct. Dyn. Liu, X., Stewart, J. 


M., Gera, L., & Kotovych, G. (1993), J. Biomol. Struct. Dyn., in press. 
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at different positions in the peptides''12, For antagonists, a turn between 
residues 2 and 5 is important, while for agonists a type | B-turn between residues 
6 and 9 is important. In the course of these Studies, an interesting abnormal 
temperature dependence coefficient was also observed for two of the bradykinin 
analogs and was attributed to aggregation. In the present study, this 
phenomenon was investigated by 1H NMR and CD for the peptides DArg-[Hyp3, 
Thi>, DSer®, DCpg’, Cpg8]-BK [I] and DArg-[Hyp3, DSer®, DCpg7, Cpg8]-BK [Il] 
(Cpg = a-cyclopentylglycine; Hyp = 4-hydroxy-L-proline, Thi = B-(2-thienyl)-L- 
alanine). The complete sequences for these two molecules are listed in Table 
6.1, together with several related analogs, which differ only by one or two 
residue(s) from these two and have been investigated previously by 1H NMR12 
(X. Liu, J. M. Stewart & G. Kotovych, unpublished data). The solvent for this 
Study is a methanol/water (80/20 v/v) mixture, which is a structure-inducing 


solvent that has been used in the study of other small peptides !3,14. 


Materials and Methods 
Materials and Sample Preparation 

The peptides were synthesized, purified and assayed in the classical BK 
assay systems as reported previously®. The samples used in all 1H NMR 
experiments were prepared by dissolving purified solid peptides in 160 ul HoO. 
The pH was then adjusted by adding small aliquots of diluted NaOH or HCl, 
followed by adding 640 ul CD30H to make the solvent an 80/20 methanol/water 
mixture. Before the NMR experiments, the samples were degassed with argon. 
The sample concentrations were 3.2 mM and 1.1 mM, the pH of the samples 
was 5.6 and 5.4 for peptides | and Il, respectively. Peptide | was diluted by a 


factor of ten with some small changes in chemical shifts. 
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NMR Spectroscopy 

All experiments were carried out on the Varian UNITY 500 NMR 
Spectrometer using a proton-selective 5 mm probe with a 90° proton pulse length 
of 8.8 to 10.0 us at normal power levels. During the experiments the 
temperature was always controlled to +0.1°C and set to 0°C except for the 
temperature studies. For FID acquisitions and data processing, VNMR 3.1 and 
3.2 software were used on a SUN 4/330 and a SUN 4/470 workstation, 
respectively. The proton chemical shifts were measured relative to the 
undeuterated fraction of the methyl group of CD3OH (Merck, Sharp & Dome, 
MD-67, 99.9% D) at 3.30 ppm. All 2D spectra were acquired non-spinning, 
whereas for 1D experiments the spinner was turned on. 

In all NMR experiments, the water resonance from the solvent was 
suppressed during the relaxation delay by presaturation. The decoupler offset 
was set equal to the transmitter offset to prevent phase distortions around the 
water peak due to imperfect cancellation of the dispersive component of residual 
water magnetization’. Before each experiment, the delays before and after the 
reading pulse (alfa delay) were carefully adjusted to give a flat and undistorted 
baseline!®, 

One-dimensional proton spectra were recorded with a sweep width of 
5000Hz, a relaxation time of 2s and 16K (zero-filled to 32K) data points. Usually 
a total of 32 scans was accumulated, and the resulting FID's were resolution 
enhanced by means of a Gaussian multiplication with a line broadening factor of 
typically -3 Hz. 

The temperature dependence of the amide protons was calculated by 
measuring the chemical shifts at -5, 0, 5, 10, 15, 20 and 25 °C for peptide | and 
at 10 temperatures between -5 and 40 °C for peptide Il, then analyzed by means 


of a linear regression. The controller temperatures were calibrated using a 
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methanol sample provided by Varian'7 and were found to be in the range of +/- 
1.0 °C of the predicted temperature. 

Two-dimensional experiments were measured with 2048 (zero filled to 
4096 ) or 4096 data points along to and 256 ty increments. Spectral widths of 
9000 Hz in both dimensions and a relaxation delay of 1.5 to 2.0 s were 
employed. All two-dimensional experiments were acquired in the phase- 
Sensitive mode by using the hypercomplex technique known as the States- 
Haberkorn-Ruben method'®. The TOCSY19 experiments were carried out by 
using the basic pulse sequence proposed by Bax@9. A 2ms trim pulse preceded 
the 70-80ms MLEV-17 spin-lock (field strength 7.0 kHz). 16 to 32 scans were 
accumulated per experiment. The NOESY experiments21!:22 were carried out 
with the 90°-t1-90°-mix-90°-FID pulse sequence, 200 or 300 ms mixing time and 
32 scans per t1 increment. Water suppression was achieved by decoupler 
saturation at all times except during the acquisition period. A 10% random delay 
was incorporated in the mixing delay to suppress zero quantum coherences. 
For TOCSY, and NOESY experiments, the compensation for first point distortion 
caused by analog filters?3 was achieved by reducing the first point of each FID 
and each t1-interferogram by empirically determined values!®. In all cases, a 
n/2 shifted, squared sine bell weighting function was used in both dimensions. 
For all the 2D spectra, baseline corrections were not necessary. 

For the deuterium exchange experiments, peptides | & Il were freeze- 
dried and the deuterium solvent was added into the NMR tube just before the 
tube was placed into the magnet. Normal one-D NMR spectra were recorded 


regularly until all the NH resonances disappeared. 
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Circular Dichroism 


The CD spectra (Figure 6.3) were recorded at 10+0.05 °C for peptide | 
and B-(2-thienyl)-L-alanine (27°C for $-(2-thienyl)-L-alanine methyl-amide) on a 
Cary model 60 spectropolarimeter with a model 6001 circular dichroism 
attachment with thermostated cell holder. The instrument had been modified to 
eliminate possible artifactual signals on passing through intense absorption 
bands. Slits were programmed for 1.5 nm band width at each wavelength. The 
instrument was calibrated with dextro-10-camphor-sulfonic acid using the 
procedure suggested by Chen and Yang24. The path length, 0.02 cm to 2 cm 
depending upon the concentration and the wavelength range, was dictated by 
the absorbance of the solution. Solutions were prepared by dissolving weighed 
quantities of peptide I in 80/20 methanol/water, and their concentrations 
calculated by using the peptide percentage and the molecular weight of the 
peptide. Mean residue ellipticities, [6] mrw (deg cm2 dmol-!), were calculated in 
the usual fashion using the mean residue weight, 125, reckoned from the 
peptide's molecular weight. Each spectrum is the average of two rescans. UV 
absorption spectra were recorded at 27°C on a Beckman model 25 


spectrophotometer. 


Results and Discussion 
Chemical Shift Assignments and Coupling Constants 

The phase-sensitive TOCSY spectra of the two peptides in 
methanol/water (80/20) alone allowed the identification of all unique amino acids 
in a sequence@s. Then, a sequential analysis was performed using the 
NOESY25 to achieve the complete assignment. These assignments rely on Hg- 
NH(i, i+1), NH-NH(i, i+1) and for Pro residues, Ha-Ho,a'(i, +1) connectivities. 


The NH resonance of the first N-terminal residue is always absent due to fast 
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exchange with the solvent since the peptides are not protected. In addition, 

NH resonances of Arg! for peptides are usually very weak and can only be seen 
at low temperature. However, the spin systems for these Arg resonances can be 
easily identified through their NH resonance in the side chain. It was difficult to 
assign specifically each of the aliphatic ring protons in the Cpg residues because 
of spectral overlap. The cis isomers due to the X-Pro or X-Hyp peptide bond do 
exist, but in very small amounts (less than 10%). This observation is consistent 
with the suggestion based on NMR experiments run in a lipophilic environment 
that all amide bonds remain trans in both agonists and antagonists2®. 

The assigned resonances for two peptides are summarized in Table 6.2. 
In order to show the effect of a DSer® residue on the peptide NH chemical shifts, 
the chemical shifts for the NHs and the a-protons for the L-Ser® analogs are also 
listed in the table?2. 

Peptides | and Il exhibit many similarities in terms of chemical shifts, 
which is understandable since the peptides only differ by one residue (Thi? in | is 
replaced by Phe? in Il), as do the two L-Ser§ analogs. From peptide | to peptide 
Il, very little change in the chemical shifts of the amide protons was observed. 
However, it is interesting to notice that the chemical shift for some NH protons 
changes when DSer® in peptides | and Il is replaced by L-Ser®. The chemical 
shifts for the NH protons of Cpg8 shift to low field (7.86 to 8.47 for I, 7.77 to 8.47 
for Il). The chemical shifts of the other NH resonances for the neighboring 
residues were also changed, but to a lesser extent. Although changes in 
chemical shifts are not necessarily a good probe for conformational changes in 
peptides, they are a sensitive indicator for the changes in the chemical 
environment which these NH protons experience. 

All vicinal coupling constants of the type SJaNH could be measured in the 


one-dimensional spectra, at different temperatures as required to minimize 
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spectral overlap. None of the coupling constants changed by more than 0.3 Hz 
over the temperature range -5 to 25°C, an indication that the conformation does 
not change significantly. All the observed SJ,njH values are close to the time- 


averaged value of 6.3 Hz for free rotation about the N-C,, bond. 


Temperature Dependence of NH-chemical Shifts 

An amide proton involved in a stable intramolecular hydrogen bond, or 
one inaccessible to solvent for steric reasons, typically shows a reduced 
temperature dependence coefficient in the range 0 to -3 ppb/K?’. That means 
that with an increase in the temperature, the NH proton shifts should move 
toward high field since the signal for the water residue moves to high field. In 
this study, the temperature range for measuring the temperature dependence 
coefficients of the amide protons was between -5 and 25 °C for peptide | and 
between -5 and 40°C for peptide Il. The results showed that the chemical shifts 
for all the amide protons except the amide protons of Cpg8 & Arg9 varied linearly 
with temperature. More surprisingly, the amide proton of Cpg8 shifted to lower 
field initially with an increase in temperature, in the opposite direction to those of 
the shifts of the other NH protons. The temperature coefficient changed from 
+1.4 between -5 and 0 °C to -0.8 between 35 and 40 °C (Figure 6.1). These 
results suggest that at a lower temperature, the NH of Cpg8 is shielded from the 
solvent by the two bulky aliphatic rings and may also exchange with other NH 
protons at lower field. In the case of Arg?, the NH temperature coefficient 
displayed linearity initially at lower temperatures, but showed non-linearity at 
higher temperatures (above 25°C) (Figure 6.1). The temperature coefficients for 
the amide protons of these two peptides are listed in Table 6.2. The abnormal 
temperature dependence coefficient for residue 8 and the non-linearity for 


residue 9 were not observed on peptides III to IX'@ ( X. Liu, J. M. Stewart and G. 


176 


& 
EO Ray, SONS hapr stio 2tnefariog origin: emi to ena 
> TIAABTVGHwO et ter) noteohrm ne Dee ope syns 
ert ert’ ot aB8el0 Site 2ZoulBy BLAS Hovagado sft HA yanastingl ae 
Knod .2-i1 ertiivode Tomer seat 361 Heo teen 
| = wets pals 
iG Ynod sepa wyel 7 Wwostometin! olde is tl bevlovit natorg blir nA — 
anber 6 eworte yilasigyt 2 vation onéts iol NidWoe-oe eidizesaosnl. AO. 
sm tel VSvidqe ¢- of 0 epretiett OU Nsisiiges sonebrsqeb ewieeqmel 
arm | : Putz oot “Pal ; 1 Siege? peal af sesesonl na ritiw tert 
si) ¢ipir’ of eavern subizer istew at Jot tengig ont sonia bleh ripid biswor 
egeh swim ocamet om! onivanent a -iae Syumeqmel ent \yooe | 
‘1 iy &&- bn ayer sy enefoie ebimes em % 2 att c 
rit boworle afiyear eff J ebiiqeq 1h 0s Ons Se Nee 
ait haha Ein ina i) 2NGTO1G Gaon SRIQsone 2g sbime ert 
of bette: Icing te note ky chihs.erh wpmiataias ore 
zor) Of Paiesh aieooge otf Alo wim eainel al eastand geil 
mov! bepnerio insaitisoo ssuieieqaie! ont enonnd 74 1ertro ari to 
(1.8 sugRy O° O& oh C6 nieeuwise6.0-er0" 0 tae e 
ott met beplewis apatite Fi oft sruip rae tet aewol & te Sect 
be SHTIO WAV SOK IS Car yon 208 2H ice ini Giles our oat 


ineisitteos owikeugiel Hii ait} % wu ic eae off mt .btell Tew 


oy 
Je Yinsenii-non bewore tud 2 uisegmef wor as yieiinl 
vot ainsioiiegs suiisiaqmel att (0.8 enugry) (9’as. evods) 2 


isrmonds eit .$.3 sidsi m betel on ms goin 


jor yhssnii+on offi one 8 sublem el Gee ed 
ay i s 
© bas heweitG WU uil .X)S boreal ) eowres 


Kotovych, unpublished results), for which the temperature dependence 
coefficients were obtained between -5 and 25 °C. Based on these observations 
(Table 6.2), the abnormal temperature dependence behavior of peptides | & Il is 
only present for the sequence of DSer6-DCpg’-Cpg8-Arg9, i.e., D6-D7-L8-L9 is 


the required sequence of configurations of the C-terminal amino acids. 


Isotope Exchange 
Figure 6.2 shows the behavior of the NH protons of Arg9, DCpg’ and 


Cpg8 for peptide | in the CD30D/D90 (80/20) solvent. The first spectrum in the 
figure was recorded in CD30H/H90 (80/20) solvent, which displays a small 
difference in chemical shift compared to the rest of the spectra due to the solvent 
isotope effect28. With these three exceptions, the remainder of the NH proton 
resonances disappeared before the first spectrum was recorded 2 minutes after 
the deuterated solvent was added. As can be seen, the isotope exchange rates 
for the three NHs are very slow, considering the size of the peptide. The 
exchange rate for the NH of the Arg? is slower, and the NH of DCpg/ is faster 
than the NH of Cpg8, which displayed the abnormal temperature dependence 
coefficient. It seems that the NHs of the three residues are trapped and not 
exposed to the bulk solvent in the aggregate. These aggregation properties are 
also supported by the CD data where aggregation is already observed for a 
0.150 mM solution (Figure 6.3). Similar behavior was observed for peptide II in 
the isotope exchange experiment, except the NHs of DCpg’, Cpg8, and Arg? 
exchanged more rapidly because of a lower sample concentration. Isotope 
exchange experiments were also carried out on peptides IV & VII, and slow 
exchange rates for the NH protons in these two molecules were not observed. 
The salient feature of the CD spectrum of peptide | (Figure 6.3) is the 


broad positive band extending from 260 nm down to 213 nm with an extremum 
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at 225 nm. This band would not appear to be conformationally related seeing 
that B- and y-turns generally exhibit negative CD bands in this spectral region 
even when the peptide contains D-aminoacids29:30, Although in a few cases the 
band is positive, the extrema are located much further into the blue31,32, On the 
basis of the spectral properties of the model compound B-(2-thienyl)-L-alanine 
methylamide (inset to Figure 6.3), we tentatively assign the band to the thienyl 
moiety of the Thi aminoacid residue in the peptide: the model compound exhibits 
a broad UV absorption band with Amax=234 nm characteristic of thiophenes3 
and , as anticipated, the band is optically active. In particular, note the broad 
system of strongly overlapping CD bands extending from 260 nm to 218 nm. It 
is interesting that the positive band in the peptide spectrum is insensitive to 
concentration, suggesting that the same chiral environment of the thienyl moiety 
is maintained over the designated concentration range. 

In summary, our experimental results, i.e., abnormal temperature 
dependence of the NH protons of Cpg8, the slow exchange rate of the NH 
protons of Arg?, DCpg’ and Cpg8, and the CD spectrum, suggest that 
aggregation exists around the Cpg8 residue in peptides | & Il. This aggregation 
was observed only for peptides with the sequence DSer6-DCpg/-Cpg8-Arg9, as 
only these two peptides displayed the abnormal temperature dependence (Table 
6.1). Our model for this aggregation is that two molecules interact in a head-to- 
tail fashion, with an overlap of residues 6 to 9. In this way, the hydrophilic side- 
chains of Ser® and Arg? point approximately away from the hydrophobic core, 
while the highly hydrophobic side-chains of the two Cpg residues point inwards, 
forming the hydrophobic core. It is in this core that the NH of Cpg8 is trapped 
between two bulky aliphatic rings of residues DCpg’ and Cpg8, which causes 
the inaccessibility of this NH to the solvent. The driving force for this aggregation 


is likely the hydrophobic interaction. Some inter-residue NOEs between the NHs 
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and side-chains were observed for residues 6 to 9, but spectral overlap in the 
aliphatic region, together with aggregation, prevent us from drawing any 
conformational conclusion at the concentrations used for the NMR experiments. 
Aggregation is of concern in all spectroscopic studies of peptides. The 
conclusions that can be drawn about peptide conformational preferences 
depend critically upon whether a structure is intrinsic to the monomer or is 
induced by aggregation?4. Usually, measurement of the concentration 
dependence of the NMR or CD spectra (over the widest possible concentration 
range) is sufficient to identify peptides that have a tendency to associate. In the 
case of the two peptides under study, the concentration of the samples was low, 
but it was the abnormal temperature dependence coefficients that warned us of 
aggregation, which was confirmed further by the exchange and the CD 


measurements. 
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Table 6.1 Peptides Studied. 


| DArg9-Arg1-Pro2-Hyp3-Gly4-Thi2-DSer8-DCpg’-Cpg8-Arg9 
1 DArg9-Arg!-Pro2-Hyp3-Gly4-Phe9-DSer®-DCpg’/-Cpg8-Arg9 


i DArg9-Arg1-Pro2-Hyp3-Gly4-Thi9-Ser8-DCpg’-Cpg8-Arg9 


IV DArg9-Arg!-Pro2-Hyp3-Gly4-Phe9-Ser8-DCpg’-Cpg8-Arg9 

V DArg9-Arg1-Pro2-Hyp3-Gly4-Phe°-Ser8-Cpg’-Cpg8-Arg9 

VI DArg9-Arg!-Pro2-Hyp3-Gly4-Thi9-DSer8-DCpg’-DCpg8-Arg? 
VII DArg®-Arg1-Pro2-Hyp3-Gly4-Thi9-Ser8-DCpg’-DCpg8-Arg? 
VII Arg1-Pro2-Pro3-Gly4-Phe®-DSer§-DCpg’-Phe8-Arg9 
IX Arg1-Pro2-Pro3-Gly4-Phe®-Ser6-DCpg/-Phe8-Arg? 
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Table 6.2 Proton Chemical Shifts?, Coupling Constants and NH 
Temperature Coefficients© of the Peptides Studied4 


Peptide I. 

Peptide NH ae 6 y other protons —_ J(Hz) AT 

Residue (-ppb/K) 

DArg® : 3.92 1.86,1.86 1.62,1.62 6:3.21,3.21 : : 

eNH:7.64 

Arg! 8.96 4.59  1.84,1.84 1.70,1.61 :3.14,3.14 5 : 
(9.00) (4.55) eNH:7.62 

Pro2 : 4.70  2.30,2.30 2.00,1.89 5:3.87,3.50 . : 

Hyp? : 462 222200 4.52 §:3.81,3.81 : 

Gly4 8.91  3.94,3.94 6.0/5.5 7.4 
(8.82) (4.05,3.82) 

Thion 8.40 4.46 —3.30,3.30 5.0 4.8 
(8.30) (4.52) 

DSer§ 8.56 4.24  3.83,3.52 7.0 6.8 
(8.29) (4.43) (3.81,3.75) 

DCpg”_ _— 7.99 4.08 2.28 1.82,1.62,1.53,1.35, 7.5 3.1 
(8.09) (4.24) 1.26 

Cpg® 7.86 4.16 2.20 1.77,1.68,1.61,1.53, 8.0 not linear! 
(8.47) (4.14) 24120 

Arg? 8.03 413  1.84,1.84 1.76,1.61 §:3.17,3.17 7.0 7.4) 
(7.92) (4.13) eNH:7.46 
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Peptide NH a B Y other protons —J(Hz) AT 
che Cee Ue Da es Cppbikyy 
DArg® : 3.90 1.85,1.85 1.60,1.60  8:3.19,3.19 ; ; 
€NH:7.64 
Arg! 8.95 4.59 1.75,1.75  1.57,1.57 5:3.10,3.10 ‘ : 
(8.99) (4.54) eNH:7.68 
Pro@ 3 4.71 2.30,2.30 2.00,1.90 5:3.84,3.48 : : 
Hyp : 462 2181.96 4.53 §:3.80,3.80 : 
Gly4 8.94  3,.96,3.91 5.5/6.0 7.9 
(8.76) (3.99,3.79) 
Phe? 8.44 4.40 3.08,3.03 7.23(2,6) 5.0 5.8 
(8.26) (4.49) 7.32(3,5), 7.26(4) 
DSer® 8.45 416  3.74,3.36 8.0 6.2 
(8.27) (4.43) 
DCpg’ 7.99 4.04 227 1.80, 1.24(2) 7.5 2.9 
(8.08) (4.24) 1.61,1.35(5) 
1.52 
Cpg® 7.77 4.15 2.18 1.77,1.18(2) 8.0 not linear 
(8.47) (4.14) 1.66, 1.32(5) 
1.53 
Arg? 8.05 As] eee Bho ee 70 t 8:3.17,3.17 7.5 7.4) 
(7.91) (4.14) eNH:7.46 
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Chemical shifts are reported in ppm relative to the methyl group of 
CD30H at 3.30 ppm downfield from TMS. 

Coupling constants (in Hz) refer to JNH-q¢ and were measured from one- 
dimensional spectra. 

The temperature coefficients of the amide protons are average values of 
temperature coefficients measured at seven points between -5 to 25 °C 
as discussed in the text. 

The data were recorded at 500 MHz in CD3OH/H90 (80/20 v/v%) at 0 °C. 
The concentrations were 3.2 and 1.1 mmol/L, the pH 5.4 and 5.6 for 
peptides I and Il, respectively. 

Pairs of geminal protons x (x=a, f, y ...) are not assigned 
stereospecifically. When two distinct lines were observed, the larger 
chemical shift was arbitrarily assigned to x and the smaller to x’. 

The numbers in brackets are the chemical shifts for the corresponding L- 
Ser peptide. 

Hyp refers to 4-hydroxy-L-proline. 

Thi refers to B-(2-thienyl)-L-alanine. 

Cpg refers to a-cyclopentyl-glycine. The protons of the aliphatic ring were 
not assigned individually because of spectral overlap. 

The temperature dependence coefficients showed non-linearity. See 


Figure 1 and text for detail. 
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Figure 6.1 


Plot of the chemical shifts of the amide protons of peptide II in 
CD30H/H20(80/20) vs temperature. The concentration was 1.1 
mM and pH 5.4. Identical acquisition and processing parameters 
were used for all spectra. The NH of Cpg8 initially shifts to low field 
and then shifts to high field when the temperature is changed from 
-5 to 40°C, while the other amide protons shift to high field. The 


NH shift of Arg9 is non-linear above 20°C. Details are discussed in 


the text. 
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Figure 6.2 Plot of part of the NH region of peptide I in CD30D/D20(80/20) vs 
time. The concentration was 3.2 mM and pH 5.6. The temperature 
was OC. The spectrum at time O was_ recorded in 
CD30H/H20(80/20). The other individual spectra were recorded 


at the indicated times after sample preparation. 


189 


af 
- — ey 


— oy 


* 


Ve B.¥ 


y (OS\08)Og NIOETO a Pehimjegta noiney Hih athe had te MIT OS 
nvinsanel 6i7 3.2 ho bas Mia$. caw nowetneiaas sit aia | 
ae OD artit: is rantoud’ oT 0 a 


bebicos eyaw: evoege Isubhibni rerio siT (08 ‘f 
} Benes ois ideal votes 


nm  Debole; 


Figure 6.3 Circular dichroism spectrum of peptide I in different concentrations 
in methanol/water (80/20). The pH of the sample was about 4. 
Inset, spectral properties of (}-(2-thienyl)-L-alanine methylamide: a, 
CD spectrum; b, UV absorption spectrum. Similar spectra for B-(2- 


thienyl)-L-alanine. 
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CHAPTER 7 


General Discussion and Conclusions 


The conformational analysis of several small linear peptides, using high- 
field NMR spectroscopy and CD spectra, has been presented in this thesis. 
These conformational analyses are useful for understanding the relationship 
between the conformation and the biological activities, as well as the stabilizing 
factors on the local conformation and the mechanism of the early steps of 
protein folding. 

In recent years, both NMR and CD techniques for protein conformational 
studies have been advancing very rapidly'!:*. This advance is the result of 
hardware development, increasing computer power, biological techniques to 
label proteins with 19N and 13C isotopes, and the nature of protein molecules 
themselves, which display unique and densely packed conformations and yield a 
great deal of information through NOEs, coupling constants and NH exchange 
rates. Now, proteins with 200-300 residues can be readily studied by NMR. 

In principle, new techniques that are applied to proteins can also be 
applied to small peptides. This does not mean that the conformational studies 
for small peptides are easier. Small peptides have far fewer structural restraints 
and every atom in the molecule is usually exposed to the solvent, which means 
that less information is available from NMR experiments. Most importantly, 
peptides are flexible and adopt multiple conformations, the consequence of 
which is that the NMR parameters are time-averaged over each individual 
conformation. Because of these properties of peptides, the general procedure 


for proteins is limited in its application to peptides9. 
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For bioactive peptides, the choice of solvent is an important issue for two 
reasons. First of all, a proper solvent should mimic the environment experienced 
by the ligand in the presence of the receptor. It has been suggested that 
receptor binding sites are hydrophobic. In addition, many peptide hormones bind 
to membrane-bound receptors. This would suggest that solvents with low 
dielectric constants should be the most realistic. However, in apolar solvents, 
Structures with too many internal hydrogen bonds could be stabilized. In terms 
of simulation of an in vivo environment, a micellar solution could be a better 
choice of solvent!9. 

Secondly, the proper solvent can be used to enhance secondary 
structures. At present, almost all studies on proteins are in aqueous solutions 
and some small linear peptides have been found to adopt folded conformations 
in aqueous solutions*. However, even in the cases that the peptides were found 
to adopt some secondary structures, the folded structures are less stable and 
less ideal, and the percentage of the folded structures is usually low4:5. On the 
other hand, the structural stability of marginally stable small peptides and the 
physical interactions accounting for the stability are of current interest in order to 
understand the biological actions of peptides as well as the mechanism of early 
steps of protein folding. Under these circumstances, some less polar solvents 
are being used as stabilizing co-solvents to enhance the secondary structures. 
The most popular one is trifluoroethanol (TFE). This co-solvent simulates the 
lipid environment and reduces the water activity, and may cause the peptide to 
take up its in vivo conformation. In the trifluoroethanol and water mixture, an a- 
helix which is marginally stable in pure water can be stabilized>’. It has been 
shown that, in several peptide systems, TFE does not induce secondary 
structure indiscriminately, but regions of the polypeptide chain that are helical in 


the native state are stabilized preferentially>’”. Recently, the effect of TFE on 
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the structure of intact hen lysozyme in its native and denatured states has been 
Studied. It was found that TFE stabilizes predominantly native-like secondary 
structure in a partially folded state of an intact protein’. This finding is of 
considerable significance and provides support for the idea that studies of 
peptide fragments of proteins in TFE can have direct relevance to understanding 
the structure and stability of protein folding intermediates. 

Little is Known as to why TFE stabilizes a-helices. The dielectric constant 
of TFE is 26.67, about one-third of the value, 78.54, for water at 25°C. 
Interactions between charged species would be expected to be stronger than in 
pure water. However, it was found that the lower dielectric constant of TFE was 
not important in the stabilization of a-helices in a peptide since the magnitude of 
the effect of charged groups on the peptide was not altered significantly by either 
TFE concentration or temperature®. Also compared to water, TFE is a much 
weaker base (pKa; ~ -8.2 for TFE vs ~ -1.8 for water)9 and a stronger acid 
(pKao = 12.4 vs 15.3)9. This means that TFE is stronger at donating protons for 
hydrogen bonds but weaker at accepting protons in hydrogen bonds. The 
effects of solvent acidity and basicity on peptide amides have been probed by 
NMRY. Hydrogen bonding from the peptide amide proton to a solvent acceptor 
results in decreased electronic shielding of the proton, causing a low-field shift 
for the proton. Hydrogen bonding from a solvent donor to the amide carbonyl 
also deshields the proton indirectly. Upon changing the solvent from water to 
TFE, an amide proton exposed to the solvent moves to higher field, because of 
the increased shielding due to reduced hydrogen bonding from the NH to the 
less basic TFE, while a carbonyl exposed to the solvent causes the NH to move 
to low field, because of decreased shielding due to increased hydrogen bonding 
of the more acidic TFE. When both NH and C=O are exposed, the effect of the 


basicity on the NH predominates. Overall, this could increase the ability for the 
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peptide to form intramolecular hydrogen bonds, thus stabilizing the a-helices®. 
Other possible stabilizing factors include lowering the chemical potential of 
water, binding of TFE to the peptide, and changes in the water structure. 

In this thesis, the solvent used in all the studies is a mixture of methanol 
and water (60/40 v/v or 80/20 v/v). The physical properties of methanol are very 
similar to those of TFE. The dielectric constant of pure methanol is 32.63 at 
25°C. This solvent may stabilize B-turns in small peptides!0-12, Based on our 
studies on the peptide DEKS'!, ongoing studies on the FDEKA and YDEKA in 
both water and water/methanol solvents, evidence shows that methanol 
enhances the fi-turn which exists in the pure water. Since the peptides we 
Studied usually involve charged residues and higher concentration of methanol 
(80/20 or 60/40) were used in the studies, interactions between charge species 
seem to be important!!.12, 

It has been suggested that important criteria for a useful structure 
stabilizer are®: (1) the transition from coil to folded structure should be 
cooperative as the stabilizer concentration is increased; (2) the importance of 
one type of interaction, such as the effects of charged groups, should not be 
predominantly amplified; (3) inherently more stable structures should be 
stabilized more than less stable structures; (4) the stabilized structure should be 
the same as that formed in the absence of the stabilizer. According to these 
criteria, the effectiveness of methanol as a stabilizer should be further tested, for 
example, by methanol concentration titration on the peptides which form parts of 
proteins and were found to adopt a higher percentage of (-turns in water!3:14, 

The most difficult aspect of the study of peptides in solution results from 
the flexibility of peptides. Most proteins and peptides are flexible, with a dynamic 
equilibrium between various local energy minima. It is known that 


conformational flexibility is of importance for biological function and for folding 
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processes that proceed through numerous conformations spanning a relatively 
large phase space volume. For peptides, it is hardly possible to find a structure 
consistent with all measured NOEs/ROEs and the holonomic structural data 
(bond lengths and bond angles). It is likely that several dynamically 
interchanging conformations are involved. On the other hand, the finding of a 
structure which fulfills all constraints does not prove the absence of multiple 
conformations!®, Usually, conformational processes with exchange lifetime Te in 
the range 10-4 < Te < 10-1 s can be investigated by line shape studies. Faster 
intramolecular processes may be monitored through relaxation measurements. 
The effects on relaxation are dependent on the ratio Te/Te where Tz is the overall 
molecular tumbling correlation time. For Te/Te >> 1, population-weighted 
conformationally averaged relaxation and cross-relaxation rates are observed. 
For 0.1 < Te/T. < 10, intramolecular dynamics becomes directly relaxation- 
active, while for Te/T. << 1 conformational averaging of the molecular structural 
parameters occurs that depends on the details of the motional process. In 
addition, it is possible to sense, by rotating frame relaxation measurements, 
processes that modulate the chemical shift in the range 10°6 < Te < 1073 16, 
Only in the case where exchange processes are slow compared to the molecular 
tumbling correlation time Tc, i.e. Te/Te >> 1, and where population-averaged 
cross-relaxation rates are observed, can NOEs/ROEs have direct conformational 
implication. 

In many cases, an assumption that one distinct conformation coexists with 
a random coil ensemble is made. Under this assumption, the distinct 
conformation (B-turns and a-helices) with a significant population and some 
characteristic NMR parameters can be detected by a set of criteria3. Among 
these criteria, medium range NOEs (dep (i,i+3) for a-helices, dqn(i,i+2) for B- 


turns) are most important, since they indicate an overall folding within 3 or 4 
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residues. Coupling constants and temperature dependent coefficients are rather 
local indicators within the residue. Usually it takes the combination of unique 
NOEs, coupling constants and temperature dependent coefficients to confirm the 
existence of a secondary structure. 

Quantitation of this population is not so straightforward’. The first 
problem is to define the boundary of the structure. It is not clear how large the 
range of torsion angles of the conformations must be in order to give rise to the 
medium range NOEs. Secondly, there is no way to tell whether all the 
characteristic evidence is cooperative. For example, a series of observed 
dap (i,i+3) NOEs indicate the presence of a a-helix along the peptide chain, but 
this does not mean that the a-helix is present simultaneously all the way along 
the peptide. A small coupling constant indicates that the @ angle is in the a-helix 
or B-turn region, but does not mean that the whole helix or turn is formed. 
Finally, certain information can be common to several conformations. For 
example, a type II B-turn has a strong dqgn(i, i+1) NOE, as does an extended 
conformation. 

Attempts have been made to measure the population percentage of the 
secondary structure in multi-conformer peptides using NMR parameters, making 
the first-order approximations of uniform correlation times, idealized secondary 
structure geometry and isotropic motion (Brockbank, R. L., Ph.D thesis, U. of 
Calgary). The population of the secondary structures by this approach is 
believed to be greater than would be expected given a random distribution 
through the 9-1 torsion space available in a Ramachandran map, 

Recently, a search algorithm was presented by Ernst and his 
coworkers'6. This algorithm allows the determination of multiple conformations 
of biomolecules in solution with exchange rate constants typically between 103 


and 10” s-1 on the basis of experimental high-resolution data. Multiples of 
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Structures are generated which are consistent with the ensemble of NMR cross- 
relaxation rates (NOESY, ROESY), homonuclear and heteronuclear J-coupling 
constants and longitudinal relaxation time T4p measurements. Another 
approach is to apply restrained molecular dynamics (rMD) for dynamic structures 
which has recently been proposed by Torda et al'”. It uses a time-variable NOE 
restrained potential that depends on the past of the trajectory. These 
approaches probably reflect the trend in the conformational studies of small 
peptides in the future, which involves intensive NMR measurements 
(NOEs/ROEs, homonuclear and heteronuclear J-coupling constants, longitudinal 
relaxation time Tp measurements, and their changes with temperatures or field 
strength) and theoretical or empirical calculations (energy minimization and 
restrained molecular dynamics calculations). 

For biologically important peptides, the most important question is 
whether the conformation derived from NMR or other methods is relevant to the 
conformation which is bound to a receptor. Unfortunately, determining 
relationships between structure and activity in small peptides is an elusive goal. 
Most of the biologically active peptides whose activities are well defined and 
measurable are highly flexible molecules. Despite the availability of numerous 
analogs and data on their respective activities, the number of possible 
conformations of these peptides complicates efforts to relate structural 
parameters and activities. Furthermore, the conformation observed in a 
particular environment suitable for physical measurements (most commonly bulk 
solvents) may well not be the receptor-bound conformation. The situation is 
complicated further by the likelihood that a conformational change on the part of 
the peptide molecule is a necessary aspect of its biological action'®:19. 

Nonetheless, a great deal of effort has been invested in studies of 


preferred solution conformations of biologically active peptides. There is, no 
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doubt, a distribution of conformations in solution, and among the various 
conformers it is likely that the conformer that mediates biological activity will be 
found. But no physical method can yield a description of a single conformer 
among a population of conformers that interconvert rapidly on the time scale of 
the measurement. Moreover, there is no a priori strategy that would select the 
bioactive conformation from the population. Until a receptor can be purified and 
isolated and methods can be developed to examine the bound peptide molecule, 
only indirect approaches can be taken to determine the bioactive conformations 
of flexible peptides. 

An approach that has proved useful in elucidating structure - activity 
relationships of bioactive peptides is the development of conformationally 
constrained analogs that retain the biological activity of the native molecule. The 
limited conformational space that is available to these constrained analogs must 
include conformers that interact productively with the receptor!8.19, In fact, the 
conformation of the bound peptide can be studied directly by using the 
transferred NOE experiment under proper conditions®9.21. In a mixture of bound 
peptide with 5-20-fold excess of free peptide, the peptide in the bound state may 
adopt a different conformation from those in the free state. By the exchange 
between the macromolecular binding site and the free state, the NOEs 
characterizing the conformation of the bound molecule can be transferred to 
more easily observable resonances, which correspond either to the spectrum of 
the free molecule modulated by the exchange with the macromolecular binding 
sites, or to the average of the spectra for the bound and free molecules, 
provided that the dissociation rate is fast enough. The problems with this 
method may come from the availability of the receptor and solubility of both the 


peptide and its receptor. An even more elegant way of studying the bound 
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conformation is to isotopically label the peptide and study the complex directly, 
using heteronuclear editing techniques22. 

For the collagen telopeptides which have been studied in this thesis, 
some preliminary transferred NOE experiments have been carried out in our lab, 
but a complete analysis could not be carried out since the collagen triple-helix 
fragment is not soluble enough to give reasonable sensitivity and the mixture is 
also not stable. 

New, more potent bradykinin antagonists have been synthesized 
continuously by Dr. John Stewart. Several new antagonists are listed in Table 
7.1 and their pharmacological data are presented in Table 7.2. It will be 
interesting to study their conformations in the solution, such as methanol/water, 
TFE/water or micelles, to test the hypothesis proposed in Chapter 5 and to 
provide more experimental evidence on activity-conformation relationship of 
bradykinin antagonists. As pointed out in Chapter 5, it is impossible to carry out 
NMR studies when these antagonists are bound to the receptor. These 
antagonists function at nanomolar concentration, far below the limit of NMR 


detection sensitivity. 
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Table 7.1. A List of New Bradykinin Antagonists. 


| DArg9-Arg!-Pro?-Hyp3-Gly4-Cpg°-Ser8-DCpg/-Cpg8-Arg9 
I DArg9-Arg!-Pro2-Hyp3-Gly4-Thi9-Ser8-Digl7-Oic8-Arg9 
III DArg9-Arg!-Pro?-Hyp3-Gly4-Thi2-Ser6-DCpg/-Igl8-Arg9 


IV Dhq-DArg9-Arg!-Pro?-Hyp3-Gly4-Cpg9-Ser8-DCpg’-Cpg8-Arg9 
Abbreviations: Cpg a-cyclopentaneglycine 
Igl a-(2-indane)-glycine 
Oic octahydroindole-2-carboxylic acid 
Dhq dehydroquinuclidine 3-carboxylic acid 


Table 7.2. Pharmacological Data for Peptides I-IV listed in Table 7.1. 


Peptides pAo(Rat uterus) pAo(Guinea pig 
ileum) 


| 7.4 740) 

I 7.8 irreversible 
binding 

III 7.4 lee 

IV TaD 6.3 


DE eee ne 


pAo = negative logarithm of molar concentration of antagonist to give 50% 


inhibition. 
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